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£1 _ 

(see Rule 70.16 and Section 607 of the Ad 
These annexes consists of a total of /* sheets. 



sheets, including this cover sheet 



This report is also accompanied by ANNEXES, i.e. t sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 



This report contains indications and corresponding pages relating to the following items: 

I [x] Basis of the report 
II Q Priority 

III Q Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

IV Q Lack of unity of invention 

V M Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability, 
citations and explanations supporting such statement 

VI Q Certain documents cited 

VII ^] Certain defects in the international application 
VIII ST] Certain observations on the international application 
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VSy Fax: { + 49-89) 2399^465 


Authorized officer 
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I. Basis of the report 



1. This report has been drawn up on the basis of (Replacement sheets which have been furnished to the receiving 
Office in response to an invitation under Article 14 are referred to in this report as "originally filed" and are 
not annexed to the report since they do not contain amendments.): 



[ ] the international application as originally filed. 

, as originally filed, 
, filed with the demand, 
,, filed with the letter of 
, filed with the letter of 



[x] the description, pages 1-14, 

pages 

pages 

pages 



[x] the claims, Nos. , as originally filed, 

Nos. ; / as amended under Article 19, 

Nos. , filed with the demand, 

Nos. 1-23 filed with the letter of 15/08/97, 

Nos. . filed with the letter of , 

[x] the drawings, sheets/fig 1/4-4/4 , as originally filed, 

sheets/fig . filed with the demand, 

sheets/fig / filed with the letter of 

sheets/fig / filed with the letter of : 



2. The amendments have resulted in the cancellation of: 

[ ] the description, pages . . 

[ ] the claims, Nos. . 

[ ] the drawings, sheets/fig . 

3. [ ] This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 



4. Additional observations, if necessary: 
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V. Reasoned stateient under Article 35(2) with regard to novelty, inventive step and industrial applicability; 
citations and explanations supporting such stateient 



1. STATEMENT 



Novelty (N) Claims 1-23 YES 

Claims NO 

Inventive Step (IS) Claims 1-23 YES 

Claims NO 

Industrial Applicability (IA) Claims 1-21 YES 



Claims 22-23 (no assessment) NO 



2. CITATIONS AND EXPLANATIONS 

1) Reference is made to the following documents: 

Dl : Database Medline, File Server STN Karlsruhe, Ab- 
stract 77064466 

D2 : Abstracts of the 95th General Meeting of the Ameri- 
can Society for Microbiology, 21-25 May 1995, Wash- 
ington D.C USA, page 289, Abstract E-49. 

D3 : Biochemistry, vol, 33, 1994, pages 7014-7020 

D4 : WO-A-94/03615 

2) The present application appears to satisfy the criteri- 
ons set forth in Articles 33(2) and 33(3) PCT in respect 
of the prior art as defined in the regulations (Rule 
64(l)-(3) PCT), 

Dl discloses the purification of the toxin from 
Clostridium botulinum type F. However, Dl teaches immu- 
nization of guinea pigs with the toxoid obtained from 
said toxin. The vaccinated guinea pigs are resistent to 
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challenge by type F botulinal toxin. There is neither a 
disclosure nor suggestion of an immunization with a 
polypeptide free of toxoid. 

D2 discloses that the vaccination of humans with pure 
type F toxoid lead to the production of a protective se- 
rum antibody response. Here again, D2 is silent as to an 
immunization with a polypeptide which is free of toxoid. 

D3 discloses the preparation by recombinant techniques 
of a maltose binding protein ( MBP ) - tetanus toxin light 
chain (LC) fusion product, wherein residue Glu234 of the 
light chain is replaced by Ala, MBP being used as an af- 
finity tag to facilitate purification and subsequent 
isolation of free LC. It is shown that the Ala234-LC mu- 
tant lacks neurotoxicity in mice. However, it is clear 
that the application of the specific amino acid replace- 
ment taught by D3 is restricted to the tetanus toxin. 

D4 discloses the preparation of a fusion protein con- 
sisting of a polypeptide immunogen linked by a hinge 
fragment to an antigenic determinant of a pathogen or- 
ganism. As a preferred embodiment, D4 discloses as 
polypeptide immunogen the Tetanus toxin C fragment and 
as antigenic sequence, the Schistosoma mansoni P28 pro- 
tein, the latter being capable of being affinity puri- 
fied on a glutathione agarose matrix (see page 10, line 
6- -page 11, paragraph 2). The use of the fusion protein 
as a vaccine is also disclosed (see page 13, third 
paragraph- page 15, first paragraph; Example 5). It is 
not derivable from D4 whether the Tetanus toxin C frag- 
ment used in D4 is free of toxin activity, which is one 
of the requirements recited in the present claims. More- 
over, this requirement is not mentioned in D4 as regards 
the alternatives listed therein on page 5, lines 10-16. 
It therefore appears that even if the skilled man would 
have been tempted to apply the teaching of D4 to the 
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preparation of a fusion protein capable of protecting 
against a type F botulinum toxin, he would not have ar- 
rived at the claimed subject-matter, the above require- 
ment being not obviously derivable from D4. 

Claims 1-23 therefore appear to be novel and to imply an 
inventive step according to Articles 33(2) and 33(3) 
PCT. 

4) The priority documents pertaining to the present appli- 
cation were not available at the time of establishing 
this preliminary examination report. Hence, it is based 
on the assumption that all claims enjoy priority rights 
from the filing date of the priority document. The 
document "Biochemistry, vol. 34, Nov. 1995, pages 
15175-15181" cited in the international search report 
has therefore not been considered to be part of the 
prior art as defined in the regulations (Rule 64(1 )-( 3) 
PCT) . 



5) 



6) 



Claims 1-21 are susceptible of industrial applicability 
as defined in Article 33(4) PCT. 

For the assessment of present claims 22 and 23 on the 
question as to whether they are industrially applicable, 
no unified criteria exist in the PCT. The patentability 
can also be dependent upon the formulation of the 
claims. The EPO, for example, does not recognize as in- 
dustrially applicable the subject-matter of claims to 
the use of a compound in medical treatment, but will al- 
low, however, claims to a known compound for first use 
in medical treatment and the use of such a compound for 
the manufacture of a medicament for a new medical treat- 
ment. 
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VII. Certain defects in the international application 



The following defects in the form, or contents of the international application have been 



noted: 



1) Contrary to the requirements of Rule 5.1(a) (ii) pct, the 
relevant background art disclosed in document D4 is 'not 
mentioned in the description, nor is this document iden- 
tified therein. 

2) The description is not in conformity with the claims as 
required by Rule 5.1(a) (iii) pct. 
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VIII. Certain observations on the international application 



The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

1) Claims 1, 2, 7, 17 and 19 do not meet the requirements 
of Article 6 PCT in that the matter for which protection 
is sought is defined in terms of the result to be 
achieved. In this instance, however, such a formulation 
is not allowable because it seems possible to define the 
subject-matter in more concrete terms, viz, in terms of 
how the result is to be achieved. In this respect, the 
only substantial support which can be found in the de- 
scription for these claims is represented by the amino 
sequences SEQ ID NO:l to SEQ ID NO: 4. 

2) Claims 7, 17 and 19 also lack clarity (Article 6 PCT) in 
that the essential technical feature that the 
polypeptide is free of toxoid (see page 3, line 19; page 
4, last line) is not recited in the claims. 

3) Claim 14 lacks clarity (Article 6 PCT) in that it does 
not recite the essential technical feature that the 
chromatography column is an affinity chromatography col- 
umn (see page 8, line 15). 
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According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



DATABASE MEDLINE 

FILE SERVER STN KARLSRUHE 

ABSTRACT 77064466, 

HATHEWAY: "TOXOID OF CLOSTRIDIUM BOTULINUM 
TYPE F: PURIFICATION AND IMMUNOGENICITY 
STUDIES" 
XP002015940 

& APPLIED AND ENVIROMENTAL MICROBIOLOGY, 
(1976 FEB) 31 (2) 234-42. 
see abstract 

ABSTRACTS OF THE 95TH GENERAL MEETING OF 
THE AMERICAN SOCIETY FOR MICROBIOLOGY, 
21 - 25 May 1995, WASHINGTON D.C.USA, 
page 289 XP002015937 
MONTGOMERY ET AL: "EVALUATION OF 
BOTULINUM TYPE F VACCINE BY ELISA" 
see abstract E-49 

-/- 



1,2,12, 
22 



1,2,12, 
22 



0 



Further documenti arc listed in the continuation of box C. 



0 



Patent family members are listed in annex. 



* Special categories of cited documenti : 

"A' document defining the general state of the art which is not 

considered to be of particular relevance , 
*E* earlier document but published on or after the international 

filing date 

'L* document which may throw doubts on priority dairofs) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0* document referring to an oral disclosure, use, exhibition or 
other means 

*P* document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X* document of particular relevance; the d aimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the daimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

*&" document member of the same patent family 



Date of the actual completion of the international search 

16 October 1996 



Date of mailing of the international search report 



0 «». 11. 96 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL • 2280 HV Rijswijk 
Td. ( + 31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Authorized officer 



Sitch, W 
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^Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



BIOCHEMISTRY, 
vol. 33, 1994. 

pages 7014-7020, XP002015938 
LI ET AL: "A SINGLE MUTATION IN THE 
RECOMBINANT LIGHT CHAIN OF TETANUS TOXIN 
ABOLISHES ITS PROTEOLYTIC ACTIVITY AND 
REMOVES THE TOXICITY SEEN AFTER 
RECONSTITUTE WITH NATIVE HEAVY CHAIN" 
see the whole document 

WO, A, 94 03615 (NEDEVA HOLDINGS B V ;KHAN 
MOHAMMED ANJAM (GB); HORMAECHE CARLOS ES) 
17 February 1994 

see page 10, paragraph 2 - page 11, 
paragraph 2 

DATABASE CHEMICAL ABSTRACTS 

FILE SERVER STN KARLSRUHE 

ABSTRACT 123:219673, 

MINT0N: "PHYSICAL CHARACTERIZATION OF 

CLOSTRIDIUM BOTULINUM NEUROTOXIN GENES" 

XP002015941 

& REPORT (1993), ORDER N0.AD-A272939 ... 
see abstract 

DATABASE CHEMICAL ABSTRACTS 
FILE SERVER STN KARLSRUHE 
ABSTRACT 119:21962 

MINTON: "PHYSICAL CHARACTERIZATION OF 
CLOSTRIDIUM BOTULINUM NEUROTOXIN GENES" 
XP002015942 

& REPORT (1992), ORDER N0.AD-A248904. . . 
see abstract 

SYSTEM. APPL.MICROBIOL., 

vol. 18, no. 1, May 1995, 

pages 23-31, XP000605363 

ELMORE ET AL: "NUCLEOTIDE SEQUENCE OF THE 

GENE CODING FOR PROTEOLYTIC (GROUP I) 

CLOSTRIDIUM BOTULINUM TYPE F NEUROTOXIN: 

GENEALOGICAL COMPARISON WITH OTHER 

CLOSTRIDIAL NEUROTOXINS" 

see the whole document 

W0.A.94 21684 (PHLSB ;NIBSC (GB); SESARDIC 

DOROTHEA (GB); CHAN WOON LING (GB); SH) 29 

September 1994 

see page 12; table 1 

see page 17, last paragraph-page 18, last 

paragraph 

see page 20; table 6 

"-/-- 



7,8,10, 

12,13, 

17-19,21 



7.8,10, 

12,13, 

17-19,21 



7,8,10, 
12,13, 
17-19, 
21-23 



1-23 



1-23 



1-23 
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Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



p,x 



TRENDS IN BIOTECHNOLOGY, 

vol. 8, no. 4, April 1990, 

pages 88-93, XP000103110 

SASSENFELD: 'ENGINEERING PROTEINS FOR 

PURIFICATION" 

see the whole document 

BIOCHEMISTRY, 

vol. 34, November 1995, 

pages 15175-15181, XPO02015939 

ZHOU ET AL: "EXPRESSION AND PURIFICATION 

OF THE LIGHT CHAIN OF BOTULINUM NEUROTOXIN 

A: A SINGLE MUTATION ABOLISHES ITS 

CLEAVAGE OF SNAP-25 AND NEUROTOXICITY 

AFTER RECONSTITUTE WITH THE HEAVY CHAIN" 

see the whole document 



7,8,10, 
12,13, 
17-19, 
21-23 



7,8,10, 
12,13, 
17-19, 
21-23 
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(PCT Article 18 and Rules 43 and 44) 



Applicant's or agent's file reference 

GWS/ 18446 



FOR FURTHER 
ACTION 



see Notification of Transmittal of International Search Report 
(Form PCT/ISA/220) as well as, where applicable, item 5 below. 



International application No. 

PCT/GB 96/01409 



International filing dats(daylmonthlyear) 

12/06/1996 



(Earliest) Priority Date (dayjmonthjyear) 

12/06/1995 



Applicant 



MICROBIOLOGICAL RESEARCH AUTHORITY et al. 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 



This International Search Report consists of a total of _ 



sheets. 



[)("| It is also accompanied by a copy of each prior art document cited in this report. 



1. [^] Certain claims were found unsearchable (see Box I). 

2. Q Unity of invention is lacking (see Box II). 

3. fvl The international application contains disclosure of a nucleotide and/or amino acid sequence listing and the 

international search was carried out on the basis of the sequence listing 

[y[\ filed with the international application. 

| | furnished by the applicant separately from the international application, 

I I but not accompanied by a statement to the effect that it did not include 
— matter going beyond the disclosure in the international application as filed. 

| [ Transcribed by this Authority 

4. With regard to the title, Q the text is approved as submitted by the applicant 

|Y1 the text has been established by this Authority to read as follows: 

Type F Botulinum toxin and use thereof 



5. With regard to the abstract, 



[Y] the text is approved as submitted by the applicant 

I I the text has been established, according to Rule 38.2(b), by this Authority as it appears in 

Box HI. The applicant may, within one month from the date of mailing of this International 
Search Report, submit comments to this Authority. 



6. The figure of the drawings to be published with the abstract is: 

Figure No. 4 [^] as suggested by the applicant 

| | because the applicant failed to suggest a figure. 

| [ because this figure better characterizes the invention. 



| | None of the figures. 
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Box I Observations where certain claims were found unsearchable (Continuation of item I of first sheet) 

This International Search Report has not been established in respect of certain claims under Article I7(2)(a) for the following reasons: 



1. X Claims Nos.: 22,23 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claims 22 and 23 are directed to a method of treatment 
of the human/animal body, the search has been carried out and based on 
the alleged effects of the compound/composition. 

2. M Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1. | [ As all required additional search fees were timely paid by the applicant, this International Search Report covers all 

searchable claims. 

2. | | As all searchable claims could be searches without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. As only some of the required additional search fees were timely paid by the applicant, this International Search Report 

covers only those* claims for which fees were paid, specifically claims Nos.: 



4. | | No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest [ | The additional search fees were accompanied by the applicant's protest 

| j No protest accompanied the payment of additional search fees. 
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A. CLASSIFICATION OF SUBJECT MATTER . . 

IPC 6 C12N15/31 C12N15/62 C07K14/33 A61K39/08 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


DATABASE MEDLINE 


1,2,12, 




FILE SERVER STN KARLSRUHE 


22 




ABSTRACT 77064466, 






HATHEWAY: "TOXOID OF CLOSTRIDIUM BOTULINUM 






TYPE F: PURIFICATION AND IMMUNOGENICITY 






STUDIES" 






XP0O2O15940 






& APPLIED AND ENVIROMENTAL MICROBIOLOGY, 






(1976 FEB) 31 (2) 234-42. 






see abstract 




X 


ABSTRACTS OF THE 95TH GENERAL MEETING OF 


1,2,12, 




THE AMERICAN SOCIETY FOR MICROBIOLOGY, 


22 




21-25 May 1995, WASHINGTON D.C.USA, 






page 289 XP002015937 






MONTGOMERY ET AL: "EVALUATION OF 






BOTULINUM TYPE F VACCINE BY ELISA" 






see abstract E-49 
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Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



0 Special categories of cited documents : 

'A* document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

"L" document which may throw doubts on priority claimfs) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

'O* document referring to an oral disclosure, use, exhibition or 
other means 

*P' document published prior to the international filing date but 
later than the priority date claimed 



"T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

*X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

*Y" document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

*&" document member of the same patent family 



Date of the actual completion of the international search 

16 October 1996 


Date of mailing of the international search report 
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Type F Botulinum toxin and use thereof 

The present invention relates to type F botulinum toxin, to a fragment of type F 
botulinum neurotoxin, to production of the fragment by recombinant means and to a 
synthetic gene encoding the fragment. In particular, the invention relates to a novel 
polypeptide fragment capable of eliciting an immunological response that is protective 
against type F botulinum neurotoxin (BoNT/F) in man or animals and to a vaccine 
containing the fragment. 

Botulinum neurotoxins (BoNTs) are high molecular weight proteins (approx. 150,000 
Da) which exert potent neuroparalytic effects on vertebrates. They are elaborated by 
anaerobic Gram-positive bacteria belonging to the genus Clostridium. The majority 
of Clostridia which produce BoNT are classified as Clostridium botulinum. In recent 
years, however, isolates which resemble Clostridium barati and Clostridium butyricum 
have been shown to produce BoNT. On the basis of antigenicity, BoNT has been 
subdivided into seven distinct types, designated A to G. All seven neurotoxins 
(BoNT/A to BoNT/G) are synthesised as a single-chain 150,000 Da molecule which 
subsequently become nicked to the more potent di-chain form, composed of a heavy 
(H) chain (approx. 100,000 Da) and a light (L) chain (approx. 50,000 Da) linked by at. 
least one disulphide bridge. 

The action of BoNT involves three distinct phases. In the first phase the toxins 
become bound to acceptors on the external surface of the targeted neural cells. This 
is followed by an energy dependent internalisation step in which the toxin, or part of 
jt ( enters the cell. Thereafter, the active moiety of the toxin causes nerve cell 
dysfunction by blocking the intracellular release of the neurotransmitter, acetylcholine, 
at the nerve periphery, causing flaccid paralysis. The L chain possesses the catalytic 
activity responsible for cell poisoning and the H chain delivers this moiety to the cell 
cytoplasm by mediating binding of the toxin to the cell and subsequent internalisation. 
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The entire amino acid sequences of all 7 BoNTs are now known (Minton, N.P. (1995). 
Current Topics in Microbiology and Immunology 195: 161-187), revealing them to be 
surprisingly divergent in their primary amino acid sequences. Thus, sequence identity 
amongst the different serotypes generally does not exceed 40%, with those areas of 
homology localised to discrete domains which are interspersed with amino acid tracts 
exhibiting little overall similarity. Between the different L chains (average size 439), 
63 amino acids are absolutely conserved. Throughout the H chains (average size 
843) 97 amino acids are identical. The most notable areas of conservation include:- 
the two cysteine residues involved in the disulphide bond formation between the L and 
H chain; the histidine rich motif within the L chain associated with metalloprotease 
activity; and a highly conserved PYI/VXALN-motif found adjacent to regions identified 
as possessing membrane spanning potential. The most notable tract of sequence 
divergence amongst toxins is localised to the COOH-terminus of their respective H 
chains (amino acid 1 1 24 onwards of BoNT/A). This would appear to be consistent with 
the notion that this domain is involved in neurotoxin binding and that different toxins 
target different acceptors on neural cell surfaces. 

The effectiveness of modern food-preserving processes in Western countries has 
made outbreaks of botulism extremely rare. The frequent use of C.botulinum as a test 
organism in the food industry, and the growing use of the toxin by neurobiochemists, 
has, however, increased the need for human vaccines. The formulation of these 
vaccines has changed little since the early 1950s: partially purified preparations of the 
neurotoxins are toxoided by formaldehyde treatment and absorbed onto precipitated 
aluminium salts. Using such methodology, polyvalent vaccines (against ABCDE or 
ABEF) for human immunisation are currently available. Such vaccines suffer from the 
drawback of low immune response and considerable batch to batch variation due to 
the high proportion (60-90%) of contaminating proteins in toxoid preparations. Recent 
work has therefore concentrated on the development of procedures for the purification 
of toxins to near-homogeneity. The use of purified toxins in the production of 
vaccines, however, suffers from the drawbacks, first, of having to produce them under 
high containment and, secondly, of requiring the presence of low levels of 
formaldehyde to prevent possible reversion of the toxoid to the active state. 
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Production of subunit vaccines against other organisms and toxins has been 
investigated by a number of laboratories. This work has focused on the best known 
toxin subtypes, namely A and B, leading to new vaccines giving specific immunity 
against toxins of type A or B. Each new vaccine, however, may not give protection 
against other toxin subtypes. 

Recombinant production of vaccine components has brought great advances in 
vaccine purity and volume of production. A J. Makoff et al, in Bio/Technology, volume 
7, October 1989, pages 1043-1046, describe the expression of a tetanus toxin 
fragment in E.coli, and its purification and potential use as a vaccine. The technique 
described nevertheless requires a large number of steps to recover purified vaccine 
components from the host cells. 

It is an object of this invention to produce a vaccine against a type F botulinum toxin. 
It is another object to simplify vaccine manufacture. A further object is to improve 
production of botulinum toxin vaccines. A still further object of the invention is to 
overcome or at least mitigate problems and/or limitations in existing vaccines and 
methods of production. 

According to a first aspect of the invention there is provided a polypeptide free of 
botulinum toxin activity which induces protective immunity to a type F botulinum toxin. 
The polypeptide is useful in manufacture of a vaccine against type F toxin, and in 
contrast to prior art compositions such as polyvalent vaccines is not a toxoid and does 
not need pretreatment with formaldehyde. Also in contrast to prior art compositions 
the polypeptide is generally of smaller size than the toxin itself. 

An embodiment of the first aspect of the invention provides a polypeptide 
characterized in that it:- 

(a) is free of botulinum toxin activity, and 

(b) is capable of eliciting, in a mammal, an immunological response that is 
protective against type F botulinum toxin. 
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The term "protective" used in conjunction with "immunity" and "immunological 
response" is used to indicate an increased ability to survive challenge by active 
botulinum toxin F. This increase is typically mediated by an increased titre of 
antibodies to the toxin or an increased ability to produce antibodies to the toxin upon 
challenge with toxin. The term is not intended to indicate absolute protection against 
any amount of toxin. 

The invention thus offers specific protection against a type F botulinum toxin, 
protection that has hitherto been unavailable. 

In a particular embodiment the present invention provides a peptide or peptide 
conjugate comprising the amino acid sequence of the C. botulinum strain Langeland 
BoNT/F from amino acids 848 to 1278 (SEQ ID NO:1), but lacking the amino acid 
sequences of the L chain and H N epitopes necessary for metalloprotease activity and 
toxin internalisation (found between amino acids 1 to 439 and 440 to 847, 
respectively); the peptide is capable of inducing an immune response that is protective 
against BoNT/F when administered to humans or other animals. 

In a more particular embodiment the peptides of the invention consist of substantially 
only the sequence of amino acids from 848 to 1278 (SEQ ID NO:1) of the amino acid 
sequence of BoNT/F of the Clostridium botulinum strain Langeland, or of that 
sequence in the form of a fusion peptide with another amino acid sequence not being 
amino acids 1 to 847 of BoNT/F. The term x other amino acid sequence* will be 
understood by a person skilled in the art to include complete proteins as well as 
relatively short amino acid sequences as appropriate to the needs of the user. 
Optionally, the other amino acid sequence is a non-C. botulinum, antigenic protein 
which is included fused to the aforesaid sequence for the purpose of providing other 
immunity or labelling, or for modifying expression of the polypeptide in a host cell. 

In another embodiment of the invention the polypeptide comprises a fragment or a 
derivative of a type F botulinum neurotoxin free of botulinum toxin activity and capable 
of induce protective immunity against type F toxin. The fragment is free of toxoid and 
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free of formaldehyde and has a length of less than 700 amino acids, preferably less 
than 500 amino acids. 

In further specific embodiments of the invention the fragment is selected from:- 

(a) amino acids 848-1278 of a type F botulinum toxin, (SEQ ID NO:1) 

(b) amino acids 848-991 of a type F botulinum toxin, (SEQ ID NO:2) 

(c) amino acids 992-1 135 of a type F botulinum toxin, (SEQ ID NO:3) and 

(d) amino acids 1136-1278 of a type F botulinum toxin (SEQ ID NO:4). 

The invention also relates to a toxin derivative, being a synthetic polypeptide 
comprising a plurality of fragments of a type F botulinum toxin linked together in 
repeated sections. The derivative can comprise a dimer of the fragments specified 
above. 

The first aspect of the invention also provides polypeptide compositions, free of 
botulinum toxin activity and capable of inducing protective immunity against botulinum 
toxin, which compositions are adapted so as to facilitate their processing. This is of 
advantage in the manufacture of vaccines as polypeptide must be separated out from 
a mixture of any components that are undesirable in an eventual vaccine. Such an 
adapted composition comprises. 

(1) a polypeptide, free of botulinum toxin activity and capable of inducing 
protective immunity against a botulinum toxin; and 

(2) a polypeptide adapted for purification of the composition. 

Component (2) is adapted, for example, to facilitate purification of the composition 
from aqueous solution and optionally comprises an antibody, a binding region of an 
antibody, a polypeptide adapted to bind to an ion exchange column, a polypeptide 
adapted to bind to an affinity chromatography column or a purification ligand. 



The composition preferably comprises or consists of a single polypeptide including 
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components (1) and (2), for example in the form of a fusion polypeptide. 

In use of the compositions, extraction of the compositions from a mixture such as the 
supernatant from lysed cells expressing the composition is rendered a simple and fast 
process. It is particularly advantageous that in the composition, the vaccinating 
properties of component (1) are substantially retained, meaning that after purification 
of the composition it is used in a vaccine without the need for further modification, in 
particular without the need to remove component (2). As candidates for component 
(1) of the composition, all polypeptides previously described according to the first 
aspect of the invention are suitable. Further, fragments of tetanus toxin, free of toxin 
activity, are suitable. 

A polypeptide according to a specific embodiment of the invention thus comprises a 
fusion protein of:- 

(a) amino acids 848 to 1278 (SEQ ID NO:1) of a type F botulinum 
neurotoxin, with 

(b) a purification moiety. 

It is preferred that the purification moiety is adapted to bind to an affinity 
chromatography column. Atypical purification moiety comprises from 50 to 500 amino 
acids. In a specific embodiment the fusion protein comprises maltose-binding protein 
as the purification moiety. This fusion protein is particularly suitable for purification 
using an affinity chromatography column and has been found to have useful 
vaccinating properties, as described below. 

According to a second aspect the invention provides a vaccine against a botulinum 
toxin, comprising a polypeptide of the first aspect of the invention and a 
pharmaceutically acceptable carrier. 

Suitable carriers are known to a person of skill in the for preparation of the vaccine. 
In an embodiment hereinafter described the carrier includes Freund's adjuvant. 
Another suitable carrier component is precipitated alum salts. 
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In a third aspect of the present invention there is provided a recombinant DNA 
encoding polypeptides of the invention. Such recombinant DNA is conveniently 
provided by PCR amplification of the DNA coding for the desired sequence, eg. f 
BoNT/F 848-12 78. usin 9 primers targeted at respective ends of the double stranded 
sequence. Optionally the template sequence used in PCR represents the natural 
clostridial gene. In a preferred embodiment of the invention, however, the sequence 
used is a synthetic sequence encoding the same amino acids as the natural clostridial 
protein but in which codon usage has been altered. It is preferred that the synthetic 
gene has a GC content of at least 40%, preferably at least 45% and most preferably 
at least 50%. 

In the case of such a synthetic sequence, insertion into the chosen expression plasmid 
is achieved, in one embodiment of the invention, through the use of incorporated 
appropriate restriction endonuclease recognition sites positioned at the extremities of 
the DNA fragment during its construction. 

By whatever means the recombinant DNA encoding the BoNT/F peptide is generated, 
it is ligated into a suitable expression vector at which stage genetic fusion to a desired 
fusion peptide encoding sequence occurs, if desired, and the resultant vector is 
introduced into a suitable cell line, eg., E. coli or a yeast such as Pichia pastoris. A 
cell line producing the desired product is selected through established procedures, eg., 
Western Blotting, or ELISA. 

Fourth and fifth aspects of the invention provide respectively, a plasmid vector 
incorporating the DNA of the third aspect and a cell line comprising the plasmid and 
expressing the DNA. 

The invention also provides a method for production of a toxin vaccine in which 
purification of active vaccinating agent is facilitated by its expression in combination 
with a polypeptide sequence adapted for purification. Accordingly, a sixth aspect of 
the invention provides a method for production of a toxin vaccine, said vaccine 
comprising a vaccinating polypeptide free of toxin activity and capable of inducing 
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protective immunity against a toxin, wherein the method comprises expressing in a 
host cell a DNA sequence coding for a fusion protein, said fusion protein comprising 
said vaccinating polypeptide and a purification moiety, obtaining an extract from the 
host cell comprising the fusion protein, and purifying therefrom the fusion protein. 

In preferred embodiment of the sixth aspect of the invention there is provided a 
method of producing a vaccine containing a polypeptide of the first aspect of the 
invention, comprising the steps of:- 

(a) expressing in a host cell a DNA encoding a fusion protein, said protein 
being a fusion of (i) a fragment of a botulinum toxin, said fragment being 
free of toxin activity and capable of inducing protective immunity against 
botulinum toxin, and (ii) a purification moiety adapted to bind to an 
affinity chromatography column, 

(b) obtaining from said host cell an extract comprising the fusion protein, 
and 

(c) purifying the fusion protein using an affinity chromatography column. 

In use of an embodiment of the invention the fusion protein is removed from the 
column by elution with a substrate. The method optionally includes cleaving the fusion 
protein and retaining the toxin fragment. The method has been used specifically with 
type F toxin but applies also to all other botulinum toxins and to tetanus toxin. 

By this method the invention enables a preparation of botulinum toxin type F fragment 
free of contamination by other clostridial proteins, these latter frequently contaminating 
prior art preparations derived from cultures of Clostridium bacteria. 

The fusion protein or toxin fragment obtained is typically in a substantially pure form 
and suitable for incorporation into a vaccine or other pharmaceutical composition in 
a few simple steps. 

It should be noted that the creation of certain fusion proteins comprising the 
BoNT/F-derived peptide is useful in the initial isolation BoNT/F, following which 
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cleavage is optionally employed to purify the BoNT/F-related peptide. Where codons 
are added at the 5-end of the BoNT/F-encoding DNA to aid in translation, these 
amino acids are optionally retained at the NHg- terminal end of the final peptide, eg., 
those used to bring about secretion of the peptide or more simply the addition of an 
NH 2 -terminal methionine to initiate translation. 

A seventh aspect of the invention provides a method of making a pharmaceutical 
composition comprising:- 

(a) expressing in a host cell a DNA encoding a fusion protein, said protein 
being a fusion of (i) a botulinum toxin or a fragment thereof, free of toxin 
activity and capable of inducing protective immunity against botulinum 
toxin, and (ii)a purification moiety adapted to bind to an affinity 
chromatography column, 

(b) obtaining from said host cell an extract comprising the fusion protein, 

(c) purifying the fusion protein using an affinity chromatography column, 

(d) incorporating the purified fusion protein into a pharmaceutical 
composition. 

The purification moiety typically comprises 50 to 500 amino acids, is water soluble 
and binds to an affinity chromatography column. 

The inventors have found that a fusion protein retaining the purification moiety is of 
advantage in producing a vaccine against a type F botulinum toxin. Vaccinating 
activity is found in the fusion protein, so the purification protein does not need to be 
removed prior to vaccine manufacture, thus simplifying the manufacturing process. 
It is preferred that the purification protein is a globular, water soluble protein that binds 
to and can be purified using an affinity chromatography column. It is further preferred 
that the purification protein is highly immunogenic. Thus, a particularly preferred 
fusion protein comprises a fragment of a botulinum toxin free of toxin activity, an 
immunogenic region and a purification region adapted to bind to an affinity 
chromatography column. 
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The term immunogenic region is used above to describe a sequence of amino acids 
in a protein that is known to elicit stimulation of the immune system in humans or other 
animals. Examples of such an immunogenic region include keyhole limpet 
haemocyanin. 

Further aspects of the invention provide a pharmaceutical containing the fusion 
protein, methods of vaccinating mammals using the vaccines and compositions of the 
invention and antisera raised against the polypeptides, vaccines and compositions of 
the invention. 

There now follows description of specific embodiments of the invention, illustrated by 
drawings in which:- 

Figure 1 : shows the three major domains of a BoNT toxin. The numbers refer to 
the positions of the amino acids flanking these three domains in BoNT/F of 
C.botulinum strain Langeland; 

Figure 2: shows a schematic representation of how synthetic gene blocks were 
assembled by PCR; 

Figure 3: shows an example of a recombinant plasmid (pFHC206) made in which the 
synthetic DNA fragment in Figure 5 is inserted into the expression plasmid pMal-C2; 
and 

Figure 4: shows antibody titres against BoNT/F obtained in mice immunised with 
MBP-BoNT/F 848 _ 127 8 recombinant protein. 

SEQ ID NO:5 shows the nucleotide sequence of the region of the BoNT/F gene from 
Clostridium botulinum type F strain Langeland encoding the Hq fragment; 

SEQ ID NO:6 shows a synthetic DNA sequence encoding the BoNT/F Hq fragment 
which uses codons which are used most frequently in highly expressed genes of E. 
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coli. The codon corresponding to BoNT/F Serg 4 g begins at nucleotide position 12. 
It is proceeded by a codon specifying a NH 2 -terminal methionine codon and 
restriction sites for Nde\ and fiamHI. The codon for Asn 1278 begins at nucleotide 
position 1302, and is followed by a translational stop codon (nt 1305-1308) and a 
restriction site for Xba\\ 

EXAMPLES 

Generation of a synthetic DNA fragment encoding Hq of BoNT/F which makes use of 
codons utilised by highly expressed E. coli genes 

A synthetic sequence encoding BoNT/Fg 4 g_ 12 7g was designed by reverse translation 
of the BoNT/F amino acid sequence using the REVTRANS programme of DNASTAR 
Inc (Madison, USA). The codon code used was the "strongly expressed E. coli 
backtranslation code" (SECOLI.RTC). To facilitate the construction, a number of 
changes were then made to introduce restriction enzyme recognition sites at strategic 
points along the length of the fragment, including unique flanking proximal sites for 
SamHI and Nde\ a distal flanking site for Xba\ and internal sites for Hpal, Mlu\ and 
Sp/I. The gene was then constructed from overlapping 100 mer oligonucleotides by 
a procedure essentially as described elsewhere [Sandhu et al (1992) Biotechniques 
12:14-16]. 

Briefly, the gene was constructed as 4 individual blocks (A, B, C and D), each of 
approximately 350 bp in size. Each block was assembled from 4 x 100 mer 
alternating oligonucleotides which overlapped with each other by 20 nucleotides. 
These 4 oligonucleotides were used in a PCR to generate a composite c.350 bp 
double-stranded DNA fragment, which was subsequently amplified using 20 mer 
flanking primers. The amplified fragments of each block were then cloned directly into 
plasmid pCRII (Jnvitrogen Corp). The flanking primers of all 4 blocks were designed 
to include restriction enzyme sites which would allow their subsequent assembly into 
a contiguous fragment. Thus, block A was flanked by SamHI (5') and Hpal (3'), block 
B by Hpal (5') and MltA (3*), block C by MM (5') and Sp1L (3') f and block D by Sp1\ 



WO 96/41881 PCT/GB96/01409 

- 12- 

(5') and Xba\ (3'). Each block was, therefore, released from their respective pCRII- 
derived recombinant plasmid by cleavage with the appropriate enzyme and the 
isolated fragments ligated to pMTL23 [Chambers et al (1988). Gene 68:139-149] 
plasmid DNA which had been cleaved with BamHI and Xbal. A clone was then 
selected in which all 4 blocks had been inserted in the expected order. This was 
confirmed by nucleotide sequencing using routine methods [Maniatis et al. (1989). 
Molecular Cloning a Laboratory Manual. Cold Spring Harbor Laboratory Press], and 
the plasmid obtained designed pFHC23. 

Generation of a Hq peptide (848 to 1278) of BoNT/F of C. botulinum strain Lanqeland 

A candidate vaccine against the BoNT/F of C. botulinum was produced by expressing 
the fragment of the synthetic gene encoding the Hq fragment, amino acids 848 to 
1278. This DNA fragment was isolated from plasmid pFHC23 as an approximately 1 .3 
kb BarrH\-Xho\ restriction fragment and inserted between the unique SamHI and Sal\ 
sites of pUC9 [Vieira and Messing (1982). Gene 19: 259-268], generating the plasmid 
pFHC29. The insert was then reisolated from pFHC29 as an EcoR\-Xba\ fragment 
and inserted between the equivalent sites of the commercially available expression 
vector pMal-c2 (New England Biolabs), to yield the final plasmid pFHC206. The 
resultant plasmid expressed BoNT/F 848-1 2 7Q 33 a fusion P rotein with the vector 
encoded maltose binding protein (MBP). 

Fusion protein product (MBP-BoNT/F 848 _ 12 7 8 ) was prepared from the cell line 
containing pFHC206 in the following manner. E. coli containing pFHC206 was 
cultivated in 1 litre of media (M9, supplemented with 0.8M sorbitol, 0.5% casamino 
acids, 50 jug/ml ampicillin), shaking (200 rpm) at 37°C until an OD 60Q of 1.0 was 
achieved. At this point IPTG was added at a final concentration of 1 mM and shaking 
continued at 27°C for a further 4 hour. Cells were harvested by centrifugation (5000 
x g) and resuspended in 20 ml of lysis buffer (Protein Fusion and Purification System, 
New England Biolabs) and cells disrupted by sonication. Lysate was applied to a GPC 
column containing 180 ml of Sephacel S100, and the protein in the void fraction 
collected. MBP-BoNT/F H fl4 o 1P7fl fusion protein in this fraction was then allowed 
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to adsorb at room temperature to a 4-6 ml volume of Amylose resin (New England 
Laboratories) over a 3 hour period with gentle shaking (1 0 rpm). Recombinant fusion 
protein was then eluted in buffer (0.01 M Tris HCI ( pH 7.0) containing 5 mM maltose. 
Eluted protein was concentrated using an Amicon PM30 membrane filter. 

Toxicity of candidate vaccine 

The toxicity of the candidate vaccine fusion peptide was determined by intraperitoneal 
inoculation of 25 amounts of the total recombinant MBP-BoNT/F 848 _ 1278 protein 
into groups of 4 mice. The candidate vaccine was well tolerated and mice showed no 
signs of acute or chronic toxicity up to 2 weeks post inoculation. 

Antibody responses to candidate vaccines 

The candidate vaccine was administered to groups of 4 mice by intraperitoneal 
inoculation in complete Freund's adjuvant, and a booster inoculation given on 3 further 
occasions at two week intervals. Antibody response against purified C. botulinum 
strain Langeland BoNT/F was monitored by ELISA (Fig 4). 

Protection against toxin challenge 

Animals which were immunised with MBP-BoNT/F 848 _ 12 7 8 fusion protein were 
subjected to an intraperitoneal challenge with various doses of purified C. botulinum 
strain Langeland BoNT/F. At doses of 12 LD 5Q and above, all the control, 
unimmunised mice succumbed within 24 hour. All immunised groups of mice survived 
challenges of up to 2.4 X 10 4 LD 5Q . One of the immunised mice which had survived 
an initial challenge of 1.8, LD 50 was subsequently shown to be immune to a further 
challenge of 10 LD 5 q . 

TABLE 1 : Protection against challenge with C. botulinum strain Langeland BoNT/F 
afforded by the MBP-BoNT/F 848 _ 1278 fusion protein vaccine. A total of 4 X 25 fig 
intraperitoneal doses of antigen mixed with adjuvant were given to groups of 4 mice 
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at 14 day intervals. After 50 days mice were subjected to intraperitoneal challenges 
of varying levels of purified BoNT/F t (isolated from C. botulinum strain Langeland), and 
deaths recorded up to 4 days. 



Challenge Dose (LD 5Q ) 


Mortality/Total Animals 


Control Animals 


Immunised Animals 


2.4 X 10 4 


4/4 


0/4 


3.6 x 10 3 


4/4 


0/4 


5.4 X 10 2 


4/4 


0/4 


81 


4/4 


0/4 


12 


4/4 


0/4 


1.8 


2/4 


0/4 a 



a = one of the surviving individuals from this group was subsequently shown to be 
protected against a BoNT/F challenge equivalent to 10 6 LD 5 q. 



This invention provides a fragment (such as amino acids 848-1278) of BoNT/F 
isolated from C. botulinum strain Langeland for use as a vaccine. The fragment 
retains its immunogenic properties while still fused with MBP, dispensing with the need 
for an additional purification step. The recombinant fusion protein appears to be non- 
toxic in mice at doses up to 25;ig. Repeated doses produced a significant antibody 
response which protects animals against BoNT/F challenge. As a vaccine it offers 
several advantages over neurotoxin toxoided by formaldehyde treatment. Most 
notably, it may be prepared more easily and, due to the absence of active toxin, at a 
lower level of containment. The absence of other contaminating C.botulinum proteins 
and partially toxoided materials also make it inherently safer for vaccine application 
and less reactogenic. 
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SEQUENCE LISTING. 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 



(A 
(B 

(C 
(D 
(E 
(F 

(A 
(B 
(C 
(D 
(E 
(F 

(A 
(B 
(C 
(D 
(E 
(F 

(A 
(B 
(C 
(D 
(E 
(F 

(A 
(B 
(C 
(D 
(E 
(F 



NAME: Microbiological Research Authority 

STREET: Centre for Applied Microbiology and Research, 

Porton Down 
CITY: Salisbury 
STATE: Wilshire 
COUNTRY: UK 

POSTAL CODE (ZIP): SP4 OJG 

NAME: Nigel Peter Minton 
STREET: 27 Moberly Road 
CITY: Salisbury 
STATE: Wiltshire 
COUNTRY: UK 

POSTAL CODE (ZIP): SP1 3BZ 

NAME: Michael J Elmore 

STREET: 8 St Mary's Court, Eastrop Lane 

CITY: Basingstoke 

STATE: Hants 

COUNTRY: UK 

POSTAL CODE (ZIP): RG21 4AT 



NAME: Margaret Lamble Mauchline 
STREET: Three Dormers, Bakers Hill, 
CITY: Shaftesbury 
STATE: Dorset 
COUNTRY: UK 

POSTAL CODE (ZIP) : SP7 9BQ 

NAME: Vladimir Artymovich Pasechnik 
STREET: 1 Copper Beech Close 
CITY: Shrewton 
STATE: Wiltshire 
COUNTRY: UK 

POSTAL CODE (ZIP) : SP4 4HU 



Semley 



(ii) TITLE OF INVENTION: BOTULINUM TOXIN VACCINE AND ITS MANUFACTURE 
(iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, 



Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ser Tyr Thr Asn Asp Lys lie Leu lie Leu Tyr Phe Asn Lys Leu Tyr 
15 10 15 

Lys Lys lie Lys Asp Asn Ser He Leu Asp Met Arg Tyr Glu Asn Asn 
20 25 30 

Lys Phe He Asp He Ser Gly Tyr Gly Ser Asn He Ser He Asn Gly 
35 40 45 

Asp Val Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr Ser 
50 55 60 

Ser Lys Pro Ser Glu Val Asn He Ala Gin Asn Asn Asp He He Tyr 
65 70 75 80 

Asn Gly Arg Tyr Gin Asn Phe Ser He Ser Phe Trp Val Arg He Pro 
85 90 95 

Lys Tyr Phe Asn Lys Val Asn Leu Asn Asn Glu Tyr Thr He He Asp 
100 105 HO 

Cys He Arg Asn Asn Asn Ser Gly Trp Lys He Ser Leu Asn Tyr Asn 
115 120 125 

Lys He He Trp Thr Leu Gin Asp Thr Ala Gly Asn Asn Gin Lys Leu 
130 135 1^0 

Val Phe Asn Tyr Thr Gin Met He Ser He Ser Asp Tyr He Asn Lys 
145 150 155 160 

Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly Asn Ser Arg He 
165 170 175 

Tyr He Asn Gly Asn Leu He Asp Glu Lys Ser He Ser Asn Leu Gly 
180 185 190 

Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Gly Cys Asn 
195 200 205 

Asp Thr Arg Tyr Val Gly He Arg Tyr Phe Lys Val Phe Asp Thr Glu 
210 215 220 
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Leu Gly Lys Thr Glu lie Glu Thr Leu Tyr Ser Asp Glu Pro Asp Pro 
225 230 235 240 

Ser lie Leu Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asn Lys Arg 
245 250 255 

Tyr Tyr Leu Leu Asn Leu Leu Arg Thr Asp Lys Ser lie Thr Gin Asn 
260 265 270 

Ser Asn Phe Leu Asn He Asn Gin Gin Arg Gly Val Tyr Gin Lys Pro 
275 280 285 

Asn He Phe Ser Asn Thr Arg Leu Tyr Thr Gly Val Glu Val He He 
290 295 300 

Arg Lys Asn Gly Ser Thr Asp He Ser Asn Thr Asp Asn Phe Val Arg 
305 310 315 320 

Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Asp Val Glu Tyr 
325 330 335 

Arg Leu Tyr Ala Asp He Ser He Ala Lys Pro Glu Lys He He Lys 
340 3^5 350 

Leu He Arg Thr Ser Asn Ser Asn Asn Ser Leu Gly Gin He He Val 
355 360 365 

Met Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn 
370 375 380 

Gly Gly Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala 
385 390 395 400 

Ser Ser Trp Tyr Tyr Asn Asn He Arg Lys Asn Thr Ser Ser Asn Gly 
405 410 415 ' 

Cys Phe Trp Ser Phe He Ser Lys Glu His Gly Trp Gin Glu Asn 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ser Tyr Thr Asn Asp Lys He Leu He Leu Tyr Phe Asn Lys Leu Tyr 
15 10 15 
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Lys Lys lie Lys Asp Asn Ser lie Leu Asp Met Arg Tyr Glu Asn Asn 
20 25 30 

Lys Phe He Asp He Ser Gly Tyr Gly Ser Asn He Ser He Asn Gly 
35 40 45 

Asp Val Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr Ser 
50 55 60 

Ser Lys Pro Ser Glu Val Asn He Ala Gin Asn Asn Asp He He Tyr 
65 70 75 80 

Asn Gly Arg Tyr Gin Asn Phe Ser He Ser Phe Trp Val Arg He Pro 
85 90 95 

Lys Tyr Phe Asn Lys Val Asn Leu Asn Asn Glu Tyr Thr He He Asp 
100 105 HO 

Cys He Arg Asn Asn Asn Ser Gly Trp Lys He Ser Leu Asn Tyr Asn 
115 120 125 

Lys He He Trp Thr Leu Gin Asp Thr Ala Gly Asn Asn Gin Lys Leu 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

.(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Val Phe Asn Tyr Thr Gin Met He Ser He Ser Asp Tyr He Asn Lys 
15 10 15 

Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly Asn Ser Arg He 
20 25 30 

Tyr He Asn Gly Asn Leu He Asp Glu Lys Ser He Ser Asn Leu Gly 
35 40 45 

Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Gly Cys Asn 
50 55 60 

Asp Thr Arg Tyr Val Gly He Arg Tyr Phe Lys Val Phe Asp Thr Glu 
65 70 75 80 



WO 96/41881 PCT/GB96/01409 

- 19 - 

Leu Gly Lys Thr Glu lie Glu Thr Leu Tyr Ser Asp Glu Pro Asp Pro 
85 90 95 

Ser lie Leu Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asn Lys Arg 
100 105 HO 

Tyr Tyr Leu Leu Asn Leu Leu Arg Thr Asp Lys Ser lie Thr Gin Asn 
115 120 125 

Ser Asn Phe Leu Asn lie Asn Gin Gin Arg Gly Val Tyr Gin Lys Pro 
130 135 1^0 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn He Phe Ser Asn Thr Arg Leu Tyr Thr Gly Val Glu Val He He 
15 10 15 

Arg Lys Asn Gly Ser Thr Asp He Ser Asn Thr Asp Asn Phe Val Arg 
20 25 30 

Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Asp Val Glu Tyr 
35 40 45 

Arg Leu Tyr Ala Asp He Ser He Ala Lys Pro Glu Lys He He Lys 
50 55 60 

Leu He Arg Thr Ser Asn Ser Asn Asn Ser Leu Gly Gin He He Val 
65 70 75 80 

Met Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn 
85 90 95 

Gly Gly Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala 
100 105 no 

Ser Ser Trp Tyr Tyr Asn Asn He Arg Lys Asn Thr Ser Ser Asn Gly 
115 120 125 

Cys Phe Trp Ser Phe He Ser Lys Glu His Gly Trp Gin Glu Asn 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TCATATACTA ATGATAAAAT TCTAATTTTA TATTTTAATA AATTATATAA AAAAATTAAA 60 

GATAACTCTA TTTTAGATAT GCGATATGAA AATAATAAAT TTATAGATAT CTCTGGATAT 120 

GGTTCAAATA TAAGCATTAA TGGAGATGTA TATATTTATT CAACAAATAG AAATCAATTT 180 

GGAATATATA GTAGTAAGCC TAGTGAAGTT AATATAGCTC AAAATAATGA TATTATATAC 240 

AATGGTAGAT ATCAAAATTT TAGTATTAGT TTCTGGGTAA GGATTCCTAA ATACTTCAAT 300 

AAAGTGAATC TTAATAATGA ATATACTATA ATAGATTGTA TAAGGAATAA TAATTCAGGA 360 

TGGAAAATAT CACTTAATTA TAATAAAATA ATTTGGACTT TACAAGATAC TGCTGGAAAT 420 

AATCAAAAAC TAGTTTTTAA TTATACACAA ATGATTAGTA TATCTGATTA TATAAATAAA 480 

TGGATTTTTG TAACTATTAC TAATAATAGA TTAGGCAATT CTAGAATTTA CATCAATGGA 5^0 

AATTTAATAG ATGAAAAATC AATTTCGAAT TTAGGTGATA TTCATGTTAG TGATAATATA 600 

TTATTTAAAA TTGTTGGTTG TAATGATACA AGATATGTTG GTATAAGATA TTTTAAAGTT 660 

TTTGATACGG AATTAGGTAA AACAGAAATT GAGACTTTAT ATAGTGATGA GCCAGATCCA 720 

AGTATCTTAA AAGACTTTTG GGGAAATTAT TTGTTATATA ATAAAAGATA TTATTTATTG 780 

AATTTACTAA GAACAGATAA GTCTATTACT CAGAATTCAA ACTTTCTAAA TATTAATCAA 840 

CAAAGAGGTG TTTATCAGAA ACCAAATATT TTTTCCAACA CTAGATTATA TACAGGAGTA 900 

GAAGTTATTA TAAGAAAAAA TGGATCTACA GATATATCTA ATACAGATAA TTTTGTTAGA 960 

AAAAATGATC TGGCATATAT TAATGTAGTA GATCGTGATG TAGAATATCG GCTATATGCT 1020 

GATATATCAA TTGCAAAACC AGAGAAAATA ATAAAATTAA TAAGAACATC TAATTCAAAC 1080 

AATAGCTTAG GTCAAATTAT AGTTATGGAT* TCAATAGGAA ATAATTGCAC AATGAATTTT 1140 

CAAAACAATA ATGGGGGCAA TATAGGATTA CTAGGTTTTC ATTCAAATAA TTTGGTTGCT 1200 
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AGTAGTTGGT ATTATAACAA TATACGAAAA AATACTAGCA GTAATGGATG CTTTTGGAGT 1260 
TTTATTTCTA AAGAGCATGG ATGGCAAGAA AAC 1293 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGATCCATAT GTCTTACACT AACGACAAAA TCCTGATCCT GTACTTCAAC AAACTGTACA 60 

AAAAAATCAA AGACAACTCT ATCCTGGACA TGCGTTACGA AAACAACAAA TTCATCGACA 120 

TCTCTGGCTA TGGTTCTAAC ATCTCTATCA ACGGTGACGT CTACATCTAC TCTACTAACC 180 

GCAACCAGTT CGGTATCTAC TCTTCTAAAC CGTCTGAAGT AAACATCGCT CAGAACAACG 240 

ACATCATCTA CAACGGTCGT TACCAGAACT TCTCTATCTC TTTCTGGGTT CGTATCCCGA 300 

AATACTTCAA CAAAGTTAAC CTGAACAACG AATACACTAT CATCGACTGC ATCCGTAACA 36O 

ACAACTCTGG TTGGAAAATC TCTCTGAACT ACAACAAAAT CATCTGGACT CTGCAGGACA 420 

CTGCTGGTAA CAACCAGAAA CTGGTTTTCA ACTACACTCA GATGATCTCT ATCTCTGACT 480 

ACATTAATAA ATGGATCTTC GTTACTATCA CTAACAACCG TCTGGGTAAC TCTCGTATCT 5^0 

ACATCAACGG TAACCTGATC GATGAAAAAT CTATCTCTAA CCTGGGTGAC ATCCACGTTT 600 

CTGACAACAT CCTGTTCAAA ATCGTTGGTT GCAACGACAC GCGTTACGTT GGTATCCGTT 660 

ACTTCAAAGT TTTCGACACT GAACTGGGTA AAACTGAAAT CGAAACTCTG TACTCTGACG 720 

AACCGGACCC GTCTATCCTG AAAGACTTCT GGGGTAACTA CCTGCTGTAC AACAAACGTT 78O 

ACTACCTGCT GAACCTGCTC CGGACTGACA AATCTATCAC TCAGAACTCT AACTTCCTGA 840 

ACATCAACCA GCAGCGTGGT GTTTATCAGA AACCTAATAT CTTCTCTAAC ACTCGTCTGT 900 

ACACTGGTGT TGAAGTTATC ATCCGTAAAA ACGGTTCTAC TGACATCTCT AACACTGACA 960 

ACTTCGTACG TAAAAACGAC CTGGCTTACA* TCAACGTTGT TGACCGTGAC GTTGAATACC 1020 

GTCTGTACGC TGACATCTCT ATCGCTAAAC CGGAAAAAAT CATCAAACTG ATCCGTACTT 1080 
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CTAACTCTAA CAACTCTCTG GGTCAGATCA TCGTTATGGA CTCGATCGGT AACAACTGCA 1140 
CTATGAACTT CCAGAACAAC AACGGTGGTA ACATCGGTCT GCTGGGTTTC CACTCTAACA 1200 
ACCTGGTTGC TTCTTCTTGG TACTACAACA ACATCCGTAA AAACACTTCT TCTAACGGTT 1260 
GCTTCTGGTC TTTCATCTCT AAAGAACACG GTTGGCAGGA AAACTAATCT AGA 1313 
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CLAIMS 

1 . A polypeptide free of botulinum toxin activity which induces protective immunity 
to a type F botulinum toxin. 

2. A polypeptide characterized in that it- 
fa) is free of botulinum toxin activity, and 

(b) is capable of eliciting, in a mammal, an immunological response that is 
protective against type F botulinum toxin. 

3. A polypeptide according to Claim 1 or 2 comprising a fragment or a derivative 
of a heavy chain of a type F botulinum neurotoxin. 

4. A polypeptide according to Claim 3 wherein said fragment or said derivative is 
up to 600 amino acids long. 

5. A polypeptide according to Claims 3 or 4 wherein said fragment is selected 
from:- 

(a) amino acids 848-1278 of a type F botulinum toxin, 

(b) amino acids 848-991 of a type F botulinum toxin, 

(c) amino acids 992-1 135 of a type F botulinum toxin, and 

(d) amino acids 1136-1278 of a type F botulinum toxin. 

6. A polypeptide according to Claims 3 or 4 wherein said derivative comprises a 
dimer of the fragment according to any of (a)-(d) of Claim 5. 

7. A polypeptide composition for use in manufacture of a vaccine, said 
composition comprising:- 



(1 ) a polypeptide free of toxin activity and capable of inducing, in a mammal, 
protective immunity against a botulinum toxin or a tetanus toxin; and 
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(2) a polypeptide adapted to facilitate or .enhance purification of the 
composition. 

8. A polypeptide composition according to Claim 7 wherein the composition 
comprises a fusion protein. 

9. A polypeptide composition according to Claim 7 or 8 comprising:- 

(1) a polypeptide according to any of Claims 1-6; and 

(2) a polypeptide adapted to bind to a chromatography column. 

10. A polypeptide composition according to any of Claims 7-9 comprising a 
polypeptide adapted to bind to an affinity chromatography column. 

11. A polypeptide according to Claim 8 comprising a fusion protein of:- 

(a) amino acids 848 to 1278 of a type F botulinum neurotoxin, with 

(b) a purification moiety. 



12. A vaccine comprising a pharmaceutical^ acceptable carrier and a polypeptide 
according to any of Claims 1-6 or a polypeptide composition according to any 
of Claims 7-11. 

1 3. A recombinant DNA encoding a polypeptide according to any of Claims 1 -6 or 
a polypeptide composition according to any of Claims 7-11. 

14. A method of producing a polypeptide according to any of Claims 1-6 or a 
polypeptide composition according to any of Claims 7-1 1 comprising the steps 
of:- 
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(a) expressing in a host cell a DNA encoding a fusion protein, said protein 
being a fusion of (i) a fragment or derivative of a type F botulinum toxin, 
and (ii) a moiety adapted to bind to an affinity chromatography column, 

(b) obtaining from said host cell an extract comprising the fusion protein, 
and 

(c) purifying the fusion protein using an affinity chromatography column. 

15. A method according to Claim 14 wherein the fusion protein is removed from the 
column by elution with a substrate. 

16. A method according to Claim 14 or 15 further comprising cleaving the fusion 
protein and retaining the toxin fragment. 

17. A method of making a pharmaceutical composition comprising:- 

(a) expressing in a host cell a DNA encoding a fusion protein, said protein 
being a fusion of (i) a polypeptide free of toxin activity and capable of 
inducing protective immunity against a botulinum toxin or a tetanus toxin, 
and (ii) a purification moiety adapted to bind to a chromatography 
column, 

(b) obtaining from said host cell an extract comprising the fusion protein, 

(c) purifying the fusion protein using chromatography column, 

(d) incorporating the purified fusion protein into a pharmaceutical 
composition. 

18. A method according to Claim 17 wherein said purification moiety binds to an 
affinity chromatography column. 

19. A pharmaceutical composition comprising:- 

(a) a fusion protein, said protein being a fusion of (i) a polypeptide fee of 
toxin activity and capable of inducing protective immunity against a 
botulinum toxin or a tetanus toxin, and (ii) a polypeptide adapted to bind 
to a chromatography column; and 
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(b) a pharmaceutical^ acceptable carrier. 

20. A pharmaceutical composition according to Claim 19 wherein said fusion 
protein comprises a polypeptide according to any of Claims 1-6. 

21 . A pharmaceutical composition according to Claim 19 or 20 wherein the fusion 
protein comprises a polypeptide adapted to bind to an affinity chromatography 
column. 

22. A method of vaccinating a mammal against a botulinum toxin, comprising 
administering to said mammal a vaccine according to Claim 12. 

23. A method of vaccinating a mammal against a botulinum toxin, comprising 
administering to said mammal a pharmaceutical composition according to any 
of Claims 19-21. 



WO 96/41881 PCT/GB96/01409 



ML 




SUBSTITUTE SHEET (RULE 26) 



WO 96/41881 



PCT/GB96/01409 



2/U 



FIRST AMPLIFICATION. FIRST ROUND 
5' 3 1 5' 3 1 

X ^ 

(//?////////////?& 

3' 5 1 3 1 5' 



FIRST AMPLIFICATION.SECOND ROUND 



3' 5' 



V 



+FLANKING PRIMERS 
[P1 + P2] 



SECOND AMPLIFICATION PI 

<=■ 

v ;//////// A///;////;;;;;;;m^ \\\\\ iss^\\\\\^\\^ 

FIG. 2 



SUBSTITUTE SHEET (RULE 26) 



WO 96/41881 



PCT/GB96/01409 




pMAL Tm -c2.-p2: POLYINKER 

t Xmnl EcoRI BamHI Xbal Sail Pstl Hind III 

malE..ATC GAG GGA AGG ATT TCA6AATTC GGATCCTCT AGAGTC GAC CTG CAG GCAAGCTTG.JacZor 
ile glu glyargiile <> 



V / FACTOR Xa 
/CLEAVAGE SITE 



/ BoNT/F-H r 



EcoRI Hpal MM Spl 




Xba 

r 



0.2 01 0.6 0.8 



10 



1.2 1.3-kb 



FIG. 3 



SUBSTITUTE SHEET (RULE 26) 



WO 96/41881 



PCT/GB96/01409 




SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



Intr ional Application No 

PCT/GB 96/01409 



A. CLASSIFICATION OF SUBJECT MATTER _ . 

IPC 6 C12N15/31 C12N15/62 C07K14/33 A61K39/08 




According to International Patent Classification (IPC) or to both national classification and IPC 




B. FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K 


Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


DATABASE MEDLINE 

FILE SERVER STN KARLSRUHE 

ABSTRACT 77064466, 

HATHEWAY: "TOXOID OF CLOSTRIDIUM BOTULINUM 
TYPE F: PURIFICATION AND IMMUNOGENICS 
STUDIES" 
XP002015940 

& APPLIED AND ENVIROMENTAL MICROBIOLOGY, 
(1976 FEB) 31 (2) 234-42. 
see abstract 


1,2,12, 
22 


X 


ABSTRACTS OF THE 95TH GENERAL MEETING OF 
THE AMERICAN SOCIETY FOR MICROBIOLOGY, 
21 - 25 May 1995, WASHINGTON D.C.USA, 
page 289 XPO02O15937 
MONTGOMERY ET AL: "EVALUATION OF 
BOTULINUM TYPE F VACCINE BY ELISA" 
see abstract E-49 

-/- 


1,2,12, 
22 


j x| Further documents are listed in the continuation of box C. 


X Patent family members are listed in annex. 


* Special categories of cited documents : 

* A" document defining the general state of the art which is not 

considered to be of particular relevance 
*E' earlier document but published on or after the international 
filing date 

"L* document which may throw doubts on priority daim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

'O' document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 


T* later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

'X" document of particular relevance; the d aimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

'Y* document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art 

'&* document member of the same patent family 


Date of the actual completion of the international search 

16 October 1996 


Date of mailing of the international search report 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL * 2280 HV Rijswijk 
Tel. ( + 31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: ( + 31-70) 340-3016 


Authorized officer 

Sitch, W 



Form PCT/lSAaiO (second sheet) (July 1992) 



page 1 of 3 



INTERNATIONAL SEARCH REPORT 



tntf onal Application No 

PCT/GB 96/01409 



^Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category * 


Citation of document, with indication, where appropriate, or the relevant passages 


Relevant to claim No. 


X 


BIOCHEMISTRY, 
vol. 33, 1994, 

Daaes 7014-7020. XP002015938 
LI ET AL: "A SINGLE MUTATION IN THE 
RECOMBINANT LIGHT CHAIN OF TETANUS TOXIN 
ABOLISHES ITS PROTEOLYTIC ACTIVITY AND 
REMOVES THE TOXICITY SEEN AFTER 
RECONSTITUTE WITH NATIVE HEAVY CHAIN" 
see the whole document 


7,8,10, 

12,13, 

17-19,21 


X 


W0.A.94 03615 (MEDEVA HOLDINGS B V ; KHAN 
MOHAMMED ANJAM (GB); HORMAECHE CARLOS ES) 
17 February 1994 

see page 10, paragraph 2 - page 11, 
paragraph 2 


7,8,10, 

12,13, 

17-19,21 


X 


DATABASE CHEMICAL ABSTRACTS 

FILE SERVER STN KARLSRUHE 

ABSTRACT 123:219673, 

MINTON: "PHYSICAL CHARACTERIZATION OF 

CLOSTRIDIUM BOTULINUM NEUROTOXIN GENES" 

XP002015941 

& REPORT (1993), ORDER N0.AD-A272939 ... 
see abstract 


7,8,10, 
12,13, 
17-19, 
21-23 


A 


DATABASE CHEMICAL ABSTRACTS 
FILE SERVER STN KARLSRUHE 
ABSTRACT 119:21962 

MINTON: "PHYSICAL CHARACTERIZATION OF 
CLOSTRIDIUM BOTULINUM NEUROTOXIN GENES" 
XP0O2015942 

& REPORT (1992) .ORDER NO.AD-A248904. . . 
see abstract 


1-23 


A 


SYSTEM. APPL. MICROBIOL., 

vol. 18, no. 1, May 1995, 

Daaes 23-31 XPO00605363 

ELMORE ET AL: "NUCLEOTIDE SEQUENCE OF THE 

GENE CODING FOR PROTEOLYTIC (GROUP I) 

CLOSTRIDIUM BOTULINUM TYPE F NEUROTOXIN: 

GENEALOGICAL COMPARISON WITH OTHER 

CLOSTRIDIAL NEUROTOXINS" 

see the whole document 


1-23 


A 


WO, A, 94 21684 (PHLSB ;NIBSC (GB); SESARDIC 

DOROTHEA (GB); CHAN WOON LING (GB); SH) 29 

September 1994 

see page 12; table 1 

see page 17, last paragraph -page 18, last 

paragraph 

see page 20; table 6 

-/-- 


1-23 



Form PCT ISA/210 <cortlinu*tion of second sheet) (July 1993) 



page 2 of 3 





INTERNATIONAL SEARCH REPORT 


Int ional Application No 

PCT/GB 96/01409 


^Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category' 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A 


TRENDS IN BIOTECHNOLOGY, 

vol. 8, no. 4, April 1990, 

pages 88-93, XPOO0103U0 

SASSENFELD: "ENGINEERING PROTEINS FOR 

PURIFICATION" 

see the whole document 




7,8,10, 
12,13, 
17-19, 
21-23 


P,X 


BIOCHEMISTRY, 

vol. 34, November 1995, 

pages 15175-15181, XP002015939 

ZHOU ET AL: "EXPRESSION AND PURIFICATION 

OF THE LIGHT CHAIN OF BOTULINUM NEUROTOXIN 

A: A SINGLE MUTATION ABOLISHES ITS 

CLEAVAGE OF SNAP-25 AND NEUROTOXICITY 

AFTER RECONSTITUTE WITH THE HEAVY CHAIN" 

see the whole document 




7,8,10, 
12,13, 
17-19, 
21-23 



Form PCT/1SA/2I0 (continuation of tecond sheet) (July 1993) 



page 3 of 3 



INTERNATIONAL SEARCH REPORT 



* "enuttonal application No. 

PCT/GB 96/01409 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. [X] Claims Nos.: 22,23 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claims 22 and 23 are directed to a method of treatment 
of the human/animal body, the search has been carried out and based on 
the alleged effects of the compound/composition. 



□ 



Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



□ 



Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1. | | As all required additional search fees were timely paid by the applicant, this International Search Report covers all 



searchable claims. 



2. | | As all searchable claims could be searches without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. | | As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
covers only those claims for which fees were paid, specifically claims Nos.: 



4. | | No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest j | The additional search fees were accompanied by the applicant's protest 

j j No protest accompanied the payment of additional search fees. 



Form PCT/ISA/710 (continuation of first sheet (1)) (July 1992) 



IXJ'I'L'UKf ATfHMA f ClTADfU D tTTJODf* 
IIYI xLKIN A 1 IUIN AL otLAKCll K£JrUK 1 

hibernation on patent family mcmben 


In uonal Application No 

PCT/GB 96/01409 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



WO-A-9403615 



17-02-94 



AU-A- 
CA-A- 
EP-A- 
FI-A- 
JP-T- 
NO-A- 



4719393 
2141427 
0652962 

950396 
8503602 

950348 



03-03-94 
17-02-94 
17-05-95 
30-01-95 
23-04-96 
28-03-95 



W0-A-9421684 29-09-94 AU-A- 6432594 11-10-94 

CA-A- 2158748 29-09-94 
EP-A- 0690875 10-01-96 



Form PCT1SA. 210 (palent family uiimi) (July 1WJ) 



rbssible Dissociation of the Heparin-binding and Mitogenic 
Activities of Heparin-binding (Acidic Fibroblast) Growth Factor-1 
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Abstract. The fibroblast or heparin-binding growth 
factors (HBGFs) are thought to be modulators of cell 
growth and migration, angiogenesis, wound repair, 
neurite extension, and mesoderm induction. A better 
understanding of the structural basts for the different 
activities of these proteins should facilitate the devel- 
opment of agonists and antagonists of specific HBGF 
activities and identification of the signal transduction 
pathways involved in the mechanisms of action of 
these growth factors. Chemical modification studies of 
Harper and Lobb (Harper, J. W.. and R. R. Lobb. 
1988. Biochemistry. 27:671-673) implicated lysine 132 
in HBGF-I (acidic fibroblast growth factor) as being 
important to the heparin-binding, receptor-binding, 
and mitogenic activities of the protein. We changed 
lysine 132 to a glutamic acid residue by site-directed 
mutagenesis of the human cDNA and expressed the 
mutant protein in Escherichia coti to obtain sufficient 
quantities for functional studies. Replacement of this 
lysine with glutamic acid reduces the apparent affinity 



of HBGF-I for unncbili/ed heparin (ehues at (145 M 
NaCI vs. 1.1 M Nad for wild-type). Mitogenic assays 
established two pouts: (a) human rrrrwrtNrunt HBGF4 
is highly dependent on the presence of heparin for op* 
timal mitogenic activity, and (6) the change of lysine 
132 to glutamic acid drastically reduces the specific 
mitogenic activity of HBGF-I. The poor a u tog enic ac- 
tivity of the mutant protein does not appear to be due 
to a reduced affinity for Che HBGF receptor. Simflarty, 
the mutant H3GF4 can stimulate tyrosine kinase ac- 
tivity and induce protooncogene expression. Differ- 
ences in rhe btotogkal properties of die wild-type and 
mutant proteins were observed in tranifcctaon studies. 
Mutant HBGF-I expression in transfected NDi 3T3 
cells did not induce the same transformed phenotype 
characteristic of cells expressing wild-type HBGF-1. 
Together these data indicate that different functional 
properties of HBGF-1 may be dissociated at die struc- 
tural level. 



The heparin-binding growth factor (HBGF) 1 family 
presently consists of seven structurally related poly- 
peptides (3). The cDNAs for each have been cloned 
and sequenced, two of the proteins, HBGF-I and HBGF-2. 
have been characterized under many dif fe r en t names, but 
most often as acidic and basic fibroblast growth factor, 
respectively. Three sequence-reUted oncogenes have been 
identified: the hst oncogene was discovered based on its abil- 
ity to transform KIH 3T3 cells (9, 25. 38, 45); the int-2 on- 
cogene wis first identified as a gene activated by mouse 
marrmary tumor virus (7, t(X 1 1) and the FGF-5 oncogene 
was identified using NTH 3T3 rjiitsforrnabon assays (46. 47). 
Recently a gene termed FGF-6 was identified by screening 
a mouse cosmid library with a human hst probe under re- 
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duced stringency and was shown to be capable of transform- 
ing NTH 3T3 cells (32). Finally, an epithelial ceU-specific 
growth factor termed KGF or FGF-7 has been identified and 
its cDNA cloned and sequence d 03). 

Functions associated with HBGF-1 and HBGF-2 include 
stimulation of nutogenests, cbcmotaxrs, mesoderm indue* 
bon, ocunte extension, and ptasmmogen activator activity. 
These HBGFs also induce angiogenesis in vivo and acceler- 
ate wound repair (for reviews see it s tituces 3. 1& 27. 36). 
The mechanisms by which HBGFs promct these fonctiow 
are poorly understood but may include activation of protein 
tyrosine kinase activity & 15, 20), phosphorylation of phos- 
pholipase C*y (6), and activation of trarnediate-earty gene 
transcription (17). tn addition, both HBGF-1 and HBGF-2 
hive been shown to be relatively resistant to de g rad ati on af- 
ter internalization by receptor-mediated endocytosis (14, 24. 



O The *ockr.:tkr U(U**rut> fret*. O02I-992S/W1 1-IIIVIO 13.00 
The JottiiMi of Ctll Boio^y . Volume 111, NcwontK* W0 2l2*-2lM 



2129 



34). lntav. g.-*th factor pei _ i-*cellularly for seven! 
noun and targe fragments (15D00 and 10000 Af, for HBGF-1; 
I6JD00 hi, for HBGF-2) arc detectable after as many as 24 h. 
Frnher. nuclear or nucleolar localization of HBGF-2 has 
been observed (2, 35). 

Despite the identification of additional members of the 
HBCF family and a broad range of cells and tissues that con- 
tain the growth <actors, and despite the availability of large 
quantities of recombinant protein and increased knowledge 
of the broad spectrum of activities of potential biological 
significance that can be attributed to the HBGFs, relatively 
little is known regarding the relationship of these highly con* 
served structures to any of their known functions. Baird et 
al. 0) reported the synthesis of 25 peptides, which together 
encompass and overlap the entire sequence of HBGF-2 a« de- 
scribed by Ueno et al. (42). They reported the identifica- 
tion of two functional domains in the primary structure of 
HBGF-2 based on the abilities of synthetic peptides to inter- 
act with HBOF receptor, bind radiolabeled heparin in a solid 
phase assay, and inhibit HBGF-2 stiiriulatioaofthynuVline in- 
corporation into DNA. Using the numbering system of the 
authors (which does not corre sp on d to full length HBGF-2) 
statistically significant functional activities could be assigned 
to peptides corresponding to residues 24-68 and 106-1 15 of 
HBGF-2. Similarly. Schubert et al. 09) demonstrated that 
peptides corre s ponding to residues 1-24, 24-68, and 93-120 
of HBGF-2 are able to stimulate substratum adhesion of 
PCI2 cells. We have shown that a synthetic peptide corre- 
sponding to residues 49-72 of HBGR (using numbering 
of 1-154 for full length HBGF-1) is able to compete with 
HBGF-1 for heparin binding in a ^overlay assay (33). This 
region is hornoiogous to one of the regions of HBGF-2 (res- 
idues 24-68) described above as possessing heparin-binding 
activity. 

lb date, the most complete and infor mati ve studies docu- 
menting the effects of chemical modification of any HBGF 
on function are those of Harper and Lobb 09). Briefly, they 
were able to show that lunited reductive methytation of bo- 
vine HBGR with forrnakkhyde and cyajnoorohydride re- 
sulted in stoichiometric methylation only of lysine 132 (us- 
ing W54 numbering for full length HBGF-1). They reported 
90% rmdificatkm of this residue, with 60% dirnethytysine. 
The modified protein exhibited significantly reduced appar- 
ent affinity for uruTiobilized heparin (ehited at M17 M Nad 
vs. M.2 M NaC! forunrnodified HBGF-1). a fourfold reduc- 
tion in its ability to stimulate DNA synthesis in NTH 3T3 
fibroblasts and a similar reduction in its ability to compete 
with labeled ligand in a ra di orecep t or assay. A lysine residue 
is found at this position of HBGF4 and HBGF-2 of all spe- 
cies characterized to date, Together these data implicate a 
crucial role for lysine 132 in several of the known functions 
of HBGF-1. 

In this report we address the rote of lysine 132 in HBGF-1 
function using site-directed mutagenesis of this position to 
a glutamic acid This approach offers several advantages 
over chemical modification studies including (a) the ability 
to produce large quantities of the desired product, {b) elimi- 
nation of significant (although sub^toicruornetric) modifica- 
tion of other lysines, and (c) allowing the introduction of 
modified HBGF-1 into mammalian cells through transection 
of cDNA expression vectors designed to produce the desired 
mutant. Despite these advantages the importance of chemi- 



cal modification stud ; es such as those of Harper and Lobb 
(19) should not be underestimated for they are extremely use- 
ful in the design of a rational approach to site-directed muta- 
genesis. The irsults described here demonstrate that replace- 
ment of lysine .' — ^ with glutamic acid reduces 
significantly its apparent aifimty "or immobilized heparin 
and its mitogenic capacity. However, the apparent affinity of 
the mutant for high affimty cell suroue receptors appears un- 
altered. When assayed in the pr ese nc e of heparin where the 
din%reocemwOd-rypeandrmrtajH 
try is most apparent, mutant HBGF-1 on stimulate tyrosine 
kinase activity and induce p i utuo o c ogenc esyressfon. Func- 
tional differences between the wild-type and mutant HBGF-1 
are also apparent atotrajtsnxtiOT 
tors into NIH TH ^ ""Mists. 

Materials and Methods 
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of 10 cnM TnVMCl. pH 7A. 3 bM EDTA. 30 bM ffacoec. A I 
of he* cat >y«eyme ■ *e mm totter wm added » 10 rtM-Tkcrib 
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Sephtrwe tt <*C *iih end<ower-end mixing for 2 h The reiin «u etuted 
hj.ch*.v yi.-g ■ imirml glass funnel and successive * _V unt 

huffcr containing 0, 0 I. 0 5. 063. and 1.3 M NtCI 

TV *itd type recombinant HBGF-I c luted with the 15 M NaCI «uh. 
TV m UIJ nt »n cluted the 0.5 M NjC »tuh. Although the »ikl-fype 
prrMf.n wn essentially pun iftcr hcpann-Srpharott chromaiofrtpny, the 
mutant HBGF I conitifuted only 10-20% of the G J M N«Cl wash. Both 
prrparanoni <**rr punned to >95% punry using reverted phase HPLC(4>. 
The revcrvd -phase punned matenaJ wa< used for all reported studies. 

Characterization of Recombinant Proteins 

All preparations of punned tcc u i nb i m m human wild -type and mutant 
HBGF I wen: inalyod by SpS-PAGE. amino tod analysis, amino terminal 
seqvenemg. pept.de mapping, tnd amino acid sequencing of the peptide en- 
compassing the mutated residue. Protein conccntrebom were deter mused 
by ammo acid analysis. Aliquot) of wild-type and mutant HBGF-I were ■*> 
jected to electfophoresh nng the SOS-PACE system of Larmmii Ot» A 
15% acryiamide. 04% W-mtfhykr*bi*acryUmidt solum «u pofy- 
menttd tn a Hoefer mini-gel apptnmn and eleclr<w4inreui was earned out 
at t constant 200 V Pro* cm was visualised by flaming the get with 01% 
Coomawie blue R 250 in 50% metfunoi. (0% glacial acetc ac«« and de- 
taining with 9% glacial acetic acid. 3% methanol Samples for ammo acid 
analyst were hydrolyxed with argoo-punjed. constant baling 6 N HO at 
II5T for 18 h gsing a fVo-T«g wot fcj iation (waters Associates, MUford. 
MA). Ammo acidi were drrivatued with phetiylaothjocyanaac and tep* 
wed *.th a PTC eftajyerr (model IJOA; Apol«d Bioiyssrms, Inc.). A 
Witen 140 sy«em was used far data collection and reduction Amino and 
vquentrt were ateNishrd tning a protein sequencer (model 477A; A> 
pl.ed B««yuemv Inc.) using modified Edman cherrustry and an on-line 
model UUA PTH analyzer Pepnoe mapping of recombinant proatia a*tr 
digation with endoprotesnese Asp»N at • 1:23 moo of enzyme lo protein 
.n 50 mM Na,HPa. pH IQ> J7*C far II h una perfarmed using a nucn> 
bore HPLC system (model U0A; Applied BioayMtt. Inc.*. The appropn- 
*ie peptides were subjected to amino acid tvqurnce analysis to ntaMnh (he 
fidelity of eioress.oo of the wild -type and mutant HBCF-I vectors. 

Stability Studies 

Mrt*boJ.cl!v labeled recombinant proteins were prepared by growing bac- 
terial cuttunrt as described above antil die A, w reached MM at 
point the cells were collected by centhfngntioa. They were fcsnaprajj 
98 5% M9 minimal mediom/1.3% Luna broth and I'HIknrme 
C./mmo<| wasaddedio45 aCi/mJ. Ccfh were grown with slating i 
mm. and then far an additional 4 h m the proenee of I mMtaopro 
gaiactoiidc. Celli were collect* and growth facion panned as 
ibo*c TV punned labeled growth bciors were tncubaled far 4* h _ 
inuVpre^eimofrmdufDMEMcomainmg 10% caJf mm) that hod 
cond.t»ned for 4gh by NINTH cellv The growth factor ^containing 
was analyzed by SOS-PAGE and autoradp g ia p hy 

Mitogenic Assays 

TV imogenk activities of wild-type and mutant rt c oiifrn ant H8CF-I 
wm determined by measunng their ability to stimulate DMA ryothesu in 
NIH in cells and to support the proliferate of human umbilknl ma en- 
dotheltal edit. DNA tynthem was determined by iriensnring the amount 
of I H frrrmidine in corp o i ai e d mto oefls. Bnefty. NTH TTJ cefh were 
seeded into 4S-wdl plates and grow to near coni l utmc m DME oontainmg 
10% calf serum. The cells were serum starved (DME, 0l3% calf serum) 
far 24 h Mttogens were added to (he wetlt and im ntnaul far IS h The 
cells were pulsed with 05 ,Ci/ml of pHTAyinidme OS Ci/ntmoO far 4 b 
The cells were rinsed with PBS, 6 sod with 10% TCA. rinsed with PBS, and 
then sotubiliml with 0.5 N NaOH. Incorporahon of I'HfttryWtw mto 
«od . rtsolubic m aterial was detenmned by sen* illation counting AH assays 
were perfarmed in triplicate. 

Human umbilical *in eftdothetia! cells were pnwided by T. Mariag 
(Ajnerican Red Cross, Rocrville, MOr. They were maintained on 6bn> 
nemtwr^ed plates (2 *tg/cnr*> r« medium 199 lupp l ui i n n ul with 10% 
(*>INol> heatHnactrnttcd FBS. Ik tnbbwtx.asmmycraic. 10 U/ml hepana 
and 10 ng/ml human reconibin uHl HBCF-l. For growth assays, cefts w«- 
seeded tn 24-well plates at 2000 cells/well medium 199 snpp kii i t tn uJ 
as above with the esception of HBQrM. T>« tndkaced amouistt of wildVypt 
or mutant HBCF-I and Sepana were added to the wellv The media was 
chanted em other day. After 7dm culture, cells were trypsmued and 
counted using a hemocytorneter 
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Competition for Binding and Cros^Unking to CeC 

Surjace fUcept^rs 

Bovine braio^dcrived HBCF-I (4) una atoesed whn ts I i— — ^ Tisni 
lacioperosjdase and NofogscaOy adrw, laoesed erowia snm hntaamj ananl 
hrptfui ' ^escribed (161. Conrhjeni NTH TTJ oHH fa TlT*5 

pUtownnrana-i j r >4 » befart btadlag espr^tsnenai hi DM* con- 
taining 03 % caJf scrum. The cclH were aihiiil and hvnbntnd whn 0MB ^ 
containing 5 U/mJ hepana. Q5% BSA, and 23 mM Hepea. pN U (bandang ^ 
burTer) at room temperature far JC aha. IViOxtetten wmhn^anand wtt 

HBCF-I and oniabcied wifcMypsovnnntaMHBQF4 hhtf 
3 UAnl hepana aa usdjeated hi tint ftgnra leaaaaV Tha c 
oa ice far 90 mm. The pUaea were m y i r v*" i and want 
biiidingbuffcr. The cdb were then hvntanod far » unn « 4T *#* 1 * , 
^ni-^^ nrTri Titrrahl hi nn Thtnnailaatii nnaahinnii 



rattdotTatuiihemactioa |. n l i l bynddhnj20M IWKXpHtftTW 
cdh were washed wah PftS. «nganj fni pinna nni acAW ftv 10 a 
at 134)00 g. The prlkn were craajcnnl with 100 ad of 30 asM Irhv | wM 
FTJTA W> s x t. io% Trhon X-IOQ. Ql atM i^n|tnnihjh>allnnl 
' * la. pH 7 5 far 20 ami at 4«C The ciirnc* ttn centfrnaad far 10 

wjra« of Uemmli nnnptebnnC te*SBS^MOB ajnltnhx "* 

Stimulation of Protein Tyrosint Klnasf Aethity 



NTH TD orOs \ 

^ 2« kjunnsJ above. The c 

IOnjrmJufwik^pepr»«utnniKK» 

washed once with coW PBS ton rysed fa befc ansahsne M nu^ Trk. 
30 mM NaO, 3 asM EDTA, 30 amM NaF, 30 anM aodMa ayfognnn^hnhx 
lOOnM nsdinmorihovnnidjtav IO% TrhonX 100.1. 
fanjrl mtonde, pH 7.4. The c 



ItH or 



on ice far 10 nam. tjottn «m aerated by c 

at 10000 g far 10 ana m 4*C and nm a 

her) were nabhxsed to PACE hi hw piniim of SP1 The | 
tonrovxxOviownntf hnanenohhyjaa^ 

<15>. The hfanj snst hvnhaani whV^anaiA 




t3r90a M at4^TWbenda,,,a ■eahnl ahh 20nuMHepe»,0l% Trhon 
X-lOa )50asJ4NaO. 10%grn^.pH7Alnnnn»annn^hnh^ 
*ere ehned from the bonis wnh 2x Uananeli nnngne ennVr and a 
to PACE and westrra btotbng win nen>nl 



RNAGHBhtAnetysis 



far 44 h in OMEAX3B fCS and men chi 
*ype or ii ton HBGF4 far t 




, total SNA one prepared 07). and 
ilJ%anaroacgeh 

: fanmkszhyde. The gets < 
ph u t ugj a p l i o J to serif* mnt each fane t 
orfraded nlinafine UNA. RNA snm « 
fthen and croavanhed by ITV trrt»hntiun. The t 
and sonrcc of the DNA probes were aa faOowr («) e^bt. 2Mb Noo V 
Xho t fragment of pr^aH; Anserican Type Chhwe Gobotboav Ihxftsslle, 
MT>. (b) c^wt, IJ^Hhniriimnjtnmfiwgnnofph^ 
gd. Unr*roy of California, La JoOn,CA.(c)c^usc. L4-fcbS«I I 
ofpHSR4: ATCC.(«n| 




Ptt (OCba I rrngtnent of pHcGAP; AlCC The probes were UbeJed wnh 
("f^JCTP (X0O0 CiAntnoO aaini 



dnk XAJtS Afan at -70*C 



e capoaed to Ko» 



Trxmsfrction of NIH JT3 Cells with HBGF4 
Eukaryotic Expression Ptasmtds 

NTH 3TJ ccfh in 100 mm <hshes were unsnJbvtpJ nidi plasand DNA by 
OVcainumprKn^uneprecspunam 

either I ^g of pSV2 noo (41) or co-trnnshxted wnh a mixture (1:10 ng) of 
pSV2 aeo and earner KBGF-t winVtype espreaam wxtor (pW or HBCF-I 
muuat expressjon sector (p2bS) The picsmid p267 is described aahwect 
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Figure /. Stimulation of DMA synthesis in NIH 3T3 cHM V *iii-rype and mutant HBGF-1. Cells were |otb to near co nflue n t* tad 
serum starved for 24 h as described m Materials tnd MeinodV Cells were rated with the faafceated cakceatrarjoas of wild-type (•) or 
mutant (a) HBGF-1. incubated for 18 h. and then poised with 05 jrfTi of pH|*ynridioe/lnl for 4 h. The ceJh were harvested and incorpo- 
ntxn of radioactivity was determined. Bod) wild-type and mount HBGF-1 were assayed in the presence of 0 (Ah 5 (J), or 50 U/ml 
heparin (C). 



al.a3);p26lwai ai a ati w > lB d pyrepl«ciaf eVW^r^uToajnnwpmw 
ofp267 (enoodbai ante acio 39^55) wim 

(he ptr&sv^K t iptt hjob pimrwJ pQ 32 asiNg owowM wrftcSMnf wtA- 
ods. Celti w« split 10 10 dishes aid trvatari coIova m wJcctod fey 
teMagteedhaOMG. 10% calf sctwa coaiaaaa f SW ja m dOnwfr 
cut. The media wat chanted cwjfy d. After 4 wtu tnwafecaal odIdwjcs 
wen? analysed for HBOF4 cxpretsioa) by Weswrv Wet anatyM want, rabbit 
prfyctonaJ HBGFI-specife ambothes and '^-prcaaa A as wacrlwl 



Results 

V 

Heparin-bindlng Properties ofHBGF4 Mutant pU2E 

A drastic reduction in the apparent affinity of HBGF4 coo 
taintng glutamic acid in place of lysine at position 132 was 
observed during the purification of the recombinant proteins 
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HBGF-1 (no/mf) 

figure 2. Ability of wild-type and mutant HBGF-1 to stimulate 
growth of Woman umbilical vein endothelial cetis. Cdls were 
seeded and cultured as d es cribed ta Materials and Methods. Cell 
number after 7 d in culture in (he presence of the iudiu at d concen- 
tntions of wild-type (o/a) or mutant (a/ a) HBGF-1 in the ab- 
sence (O/A) or presence (a/a) of 50 U/ml heparin is shown. 



from the Eseheriehkk coH r/safles. Reconibmant wild-type 
HBGF-1 from E, coB tysatcs can be purified I near 
homogeneity wtth a smgte heyarin-Scphaiusc step. The pro- 
tern binds the uwiiobilnscd bepantt during tiltniive washing 
with 05 and OAS M NaO-comaming boners and behrred 
with a single step of 1 J M NaO-<OQtaimng buffer. In con- 




could not be used as a 1 
HBGF-1. The I 

the presence of 01 M NaO b«»asciutrjdomiivtbea5 M 
NaO wash Bom v ril w^ 

M NaO equates, l e sp e Ui w dy ) could be pwified to apparent 
homogen ei ty using it*jstd-phast HPLC Detailed analysis 
of the *rr* wwl * *^* > '^^ *f two purified proteins far tro- 
mobUtzed htpar us-Sepharoac using relatively shallow, linear 
NaO gradients radicated that the mutant HBGF-1 elated 
with a45 M Nad whereas wild-type teamed 1.1 M NaO 
to be ehded (data not shown). For aO of die assays described 
below we used reverses-phase HPLC purified wild-type or 
mutant HBGF-1. Ptotcui concentrations were detenczsd »y 
airtino acid analysis of preparations that bad been shown to 
be the d es i r ed HBGF4 form by peptid e (tupping and asnino 
acid seq u enc e analysis (data not shown). 

Mitogenk Properties afHBGFi Mutant piS2E 

The ability of the HBGF4 mount to «-» mitogenesis 
was compared to mat of the wild-type protem using two 
different assays. In the first, the ability of the two proteins 
to stimulate DNA syn the sis m NIH 313 edb as m easur e d 
by pHJmymidme inc o r p oiati oo was riarimrri The assays 
were conducted over a broad range of HBGF-1 and heparin 
LutiLejitiatioos. Two hrtportant points can be made from the 
data in Fig. 1. One* the wild-type HBGF4 has a dramatic 
requirement for die presence of heparin fee optimal mit<v 
genie activity and, two, die mutant HBGF4 n sigmhcantly 
less potent than wild-type protein m die presence of added 
heparin. As can be teen in Fig. 1, Che maximal difference in 
mitogetuc potency was observed tn the presence of 5 U/ml 
heparin (~30-fokf). link Difference (apptnsinunery t hr t e - 
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CLU.w HBCF I 


16 16 


13 1.2 


17 


1.4 


Wild type H8GF 1 


I : 2 0 


1.9 29 


12.6 


16.6 



fold) between the wild -type and mutant protein is seen in the 
absence of added heparin because of the relative lack of 
mitogenic activity of wild-r/pc human recombinant HBGF-I 
in the absence of heparin. The possibility that the reduced 
mitogenic activity of the mutant HBGF-I is related directly 
to its reduced apparent affinity for immobilized heparin is 
supported by the observation that the difference in the mito- 
genic potency between the wild-type and mutant protein is 
reduced to M 8-fold in the presence of 50 U/ml heparin. 

In the second mitogenesis assay the abilities of the wild- 
type and mutant proteins to support the proliferation of hu- 
man umbilicaJ vein endothelial cells were compared. The 
results shown in Fig. 2 are consistent with those described 
above in that they demonstrate a dramatic heparin require- 
ment of the wild-type HBGF-I for biological activity and that 
the mutant HBGF-I is not able to support cell proliferation 
to the same extent as the wild-type protein. These experi- 
ments were conducted in the pr e s enc e of 50 U/mJ heparin 
and the endotheliaJ cells were seeded in the presence of 10 
ng/mJ wild-type HBGF-I. When growth assays were con- 
ducted in the presence of 5 U/ml heparin without wild-type 
protein during the seeding, mitogenic deficiencies of the mu- 
tant protein were more pronounced (Table 0- The results 
shown in Fig. 3 demonstrate that the reduced mitogenic ac- 
tivity of the mutant HBGF-I does not appear to be the result 
of any increased susceptibility of the protein to proteolytic 
digestion by components in serum or the conditioned media 
of NTH 3T3 cells. 

Receptor-binding Activity of HBGF-1 Mutant pU2E 
The results presented above are consistent with the observa- 



figure I Analysis of the relative stability 
of wild -Type and mutant HBGF-1 in NTH 
3T3 cell -conditioned media. The wild- 
type and mutant proteins were labeled 
and punned as described in Materials and 
Methods. The proteins were incubated in 
the presenc t of NTH 3T3 cdl-condrtioned 
media for 48 hat 37*C and then subjected 
to SDS-WJGE The gets were dried and 
labeled proteins visualized fry autoradiog- 
raphy. Lane 1 contains wild-Type HBGF-1 
and lane 2 mutant HBGF-1. The apparent 
molecular weights of both proteins are 
identical to that of HBGF-l before incu- 
bation. 



tions of Harper and Lobb DO) us in* bovine br*ia~'terr*ed 
HBGF-I selectively methylated al lysine 132, although the 
magnitude of the reduction in mriogeoic potency (^30-foki 
for 3T3 cell assay) as compared with the ~4-fo4d ^n.^ 
reported by — tn*1 Lobb (19) is significantly greater. 
They also reported retiu. <c*-binding activity for the 
modified protein. We examined the abilities of the wild-type 
and mutant recombinant HBGF-I t compete with -la- 
be led bovine HBGF-I for binding to ceil surface re c ep t o rs 
on NTH 3T3 cells at a concentration of added heparin (3 U/ 
ml) where the difference in mitogenic po ten ci es of the two 
proteins was greatest. 

The receptor-binding activity of the mutant HBGF-I was 
established by competition for cross-Inking of ^-HBGF-l 
to 150000- and 130000-Af, proteins present on the surface 
of NTT T3 ccUs (16). The results shown m Fig. 4 demon- 
strate tnat tf-.c mutant HBGF-I is similar to wikMype protein 
in its ability to compete for rcceptor-ligand crass-linking. 

The fimctkmal consequences ofHBGF4 buidiof tottsceil 
surface receptor include stimulation of protein tyrosine ki- 
nase activity (K, IS* 20) inrmrtmg phosphorylation of phos- 
phoSp<c C y (6). Fig. 5 A demon s tr at es that both wOttype 
and mutant HBGF-t are able to increase the phosphotyrosxae 
content of I50U0OO-. 9GJD00-. and 70000-44 proteins tod, 
to a lesser extent, proteins with lower relative nsnlecular 
masses as nidged by Western blot analysis with phospbo- 
tyTOStne-specific antibodies. The dose response and rrtrnf of 
activation is similar for the two forms of the growth factor. 
StimuJaftiOQof the phosprwryTos^ 

v^-^y was eraiiiiiKJU oy aoo-pnogyuuiyiuaaa; wewcmoioiafap* 

bodies that recognize phosphoupnse C^. Fig. 5 at demoo- 
strates thr mutant HBGF-I shares with wflcMype HBGF-1 
the abtiity to ttmiu U nY tyrosine 

lipase Oy. These data regarding stmwurtioo of tyrosine ki- 
nase activity by wild-type and mutant HBGF-1 are m good 
agreement with the receptor-binding data described above 
but do not provide insight into the functional basis for the 
relatively poor mitogenic capacity of this HBGF-I i 



Protooncogene Induction by WW7)pr and 
Mutant HBGF4 

The results described above indicate that the functional prop- 
erties of the mutant HBGF-l associated with events that oc- 
cur at the oeU surface (i.e., receptor -binding and tyrosine ki- 
nase activation) are normal with respect to those of wild-type 
HBGF-I. In addition to tyrosine kinase activation, another 
carry re s oo o s e to HBGF-1 receptor-btnding is the elevation 
of protooncogene mRNA levels (IT), lb determine the effect 
of wild-type and mutant HBGF-1 on protooncogene expres- 
sion, NTH 3T3 cells were serum starved and then either left 
unstimulated or stimulated with 10 ng/ml wild-type or mu- 
tant HBGF-t. Heparin (5 Uftnl) was also added to the cefts 
receiving growth factor Cells were coOccted at various 
times after stimulation, RNA was prepared, and levels of 
c-Jhi. c-jw%. c-mve. and ghceraidehyoje ^-phosphate dehydro- 
genase mRNA fas a control for the amount of RNA loaded 
in each lane) were assayed by RNA gci Mot analysis. Wild- 
type and mutant HBGF-l increased protooncogene mRNA 
levels to a similar degree; maximal levels were observed at 
30 min ic-fos, c-jvn) or 2 h (enwye) after stimulation (Fig. 
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Figure 4. Ability of wild-type and mutanl HBGFI to compete with 
'"Habeled bovine HBGFI for cross-linking to I5O000- and 
I30j000-mol wt cdl suriaee receptors. N1H TO cells were incu- 
bated with I ng/ml bovine '"l-HBGFl tnd either 0 5. I a 5d 
10(X or 500 ng/ml of wild-type (Unci 1-5) or mutant (lanes 6-10) 
human recombinant HBGF-I in the presence of 5 U/mJ heparin. Af- 
ter incubation, the cells were treated with cross-linking reagents as 
described in Materials and Methods. The apparent mc4milir weights 
of cross-linked species were de te rm i ne d alter SDS-f*GE and auto- 
radiography. The positions of two cross-linked 150000- and 
1 30jOOO-mol wt species, which correspond to the known apparent 
molecular weights of HBGF receptors, are indicated with arrows. 



6). The addition of heparin alone did not induce protoon- 
cogene enpression. Since the autogenic differences between 
the wild-type and mutant HBGF-t are more pronounced at 
tower growth factor concentrations, we also stimulated cells 
with (15, l.(X 5.0. anu IC *' *vi w -type tnd mutant growth 
factor (again in the presence of ht^-c .). 4 . all four concen- 
trations used, (he wild-type and mutant HBGF-1 were simi- 
lar in their ability to induce c-fos mRNA expression (Fig. 7). 

Cherrxprrssion of VH &f *7>pf and Mutant HBGF-1 in 
Thmsftxted NIHJT3 Cells 

It wis dciiHJt m i il ed rjreviousiy that overexpression of wild- 
rype HBGF- 1 in transfected Swiss 3T3 cells resulted in cells 
with an elongated, transfixmed riorphologkaJ rjhenotype 
that grew to hip'x. ^duration densities (23). This trans- 
formed phenorype occurred even though the HBGF-I poly- 
peptide wis not detectable in the coodiboaed media of these 
celts. We have shown that the mutant HBGF-t b not a potent 
mitogen although it can bind receptor and initiate early 
events associated with autogenic signal transctoctkm. To in- 
vestigate whcArr if e intracellular function of the mutant 
HBGFI was altered, we examined the ability of this protein 
to induce a transformed rjhewotype in NTH 3T3 cells. Cells 
were either transfected with a pits mid conferring neomycin 
resistance or co-craflsfceted with the neomycin resistance 
pUsrnid and wild-type or mutant HBGF-I expression vectors. 

Fig. 8 shows the results of Western blot analysts of trans- 
fected celt lysates using HBGFI-specific anybodies. The 
Western blot analysis was normalized to cdl twmber and 
provides the basis for our obsignation of relatively high or 
low levels of HBGFI expression. The results shown in Fig. ' 
9 ckmonstrate that cells expressing a high level of wild-type 
HBGFI (Fig. 9 B) and to some extent a low level of wild- 
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Figure JL Stimulation of ptoteui ty rosroe ki- 
oasc activTty by wild-type and mutant 
HBGF-I. M ) Senan starved NIH JT3 ceOs 
were cither (lane / ) unw i n a iU aed or treated 
with 5 U/ml hepario and (lane 2) I ng/ml 
wild-type; (bine S) 10 qe/ral wild-type; 
(lane 4) I ng/ml mutant: or (lane 5) 10 
ng/ml mutant HBGFI. The cells were 
processed as described aa Materials and 
Methods and phcspbotyiQsinc*<coauiing 



tyrosine antibodies and ^t^roteui A. The 
arrows indicate the positions of 13Q0OO-, 
90000-. and 7rj£00-4nol wt proteins whose 
phosphoryrosine coutent are increased by 
the addition of wuaMype or mutant HBGFI. 
(B) CeOs were mrubaaed m m A with the 
cijcepuoo that cell lysates were unnsBnoprc- 
cipftaced with ara>*pcpipholayase C*tt anti- 
bodies before Western blot analysis with an- 
D-phospbrcyrostne antibodies. Cells were 
either Cane / ) unuimulattd or treated with 
(lane 2) 10 ng/ml wild-type, or (lane J) 10 
ng/ml mutant HBGF4. The arrow shows the 
position of a I SOOOO-rnol wt protein whose 
phosphotyrosine co n t en t is increased by 
treatment wim wild-type or mutant HBGFI. 
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Ftgurr 6. Effect of wild-type and mutant HBGF-I on protoon c ogen e 
mRNA levtli. Serum tt«r*ed NIH 3T3 cell* were either left un- 
stimulated or treated with heparin (5 U/ml) and 10 nt/ml wild -type 
(*r ) or mutant (ml) HBGF-I for the indicated time periods. RNA 
wu prepared and used for RNA gel blot hybridization using the ra- 
diolabeled ONA probes indicated on the left side (GAPDH. glycer- 
aldehyde 3 -phosphate dehydrogenase). The upper and lower tick 
marks on the left side of each panel np i ticnt the positions of 28 
and ISS rRNA. respectively 



stoichiometric modification of lysine residue 1 32 (using the 
1-154 numbering system for full -length HBGF-I). It wo sug- 
gested that modification of this residue, which is co rne red 
in all HBGF-I and sequences reported to date, was 

responsible for the reduce©. % .nity fo r immobilized 

heparin, the reduced autogenic capacity, *ad the reduced 
receptor-binding activiry of the modi6ed protein. The results 
presented here using site^directed mutagenesis to address 
the role of lysine 132 on the functional propqiies of HBGF-I 
are in general agreement with the conclusions of Harper and 
Lobb (19). Specifically, substitution of lysine 132 for glu- 
tamic acid reduces the apparent affinity of the reranbinant 
protein for immobilized heparin (etutes at (MS M NaO com- 
pared with 1.1 M NaO for wild-type) and significantly re- 
duces the mitogen* porrnc .\ the growth bet or. The re- 
duced mitogemc per: vy may be a direct consequence of the 
reduced apparent affinity of the mutant HBGF-! fix heparin 
since it has been dernonstrated that the class 1 heparin- 
binding growth (actors in general (29) and human HBGF-l 
in particular (22. 43) are dependent on the presence of hepa- 
rin for optimal btokwical activity. 

Our results do oot njpport me notion that the reduced 
autogenic capacity of HBGF-l cootaioing glutamic acid hi 
place of lysine at position 132 is due to reduced binding to 
cell surface receptors. The reccptor^binding properties of 
the mutant HBGF-l are not distinguishable from those 
wild -type protein as judged by cross-bilking experiments 
(see Fig. 4). In addition, the mutant HBGF-l is able to in- 
duce the same pattern of tyrosine kinase phosphorylation as 
is the wild-type protein (see Fig. 5) and eta induce p»oto- 
oncogene expression (see fig. ©"). The majority of the studies 
presented here utilize a heparin coocentrafioo of 5 U/ml; the 
concentration where maximal difference between the tnito- 
genic activity of wild-type and rmtant HBGF-l w« observed 
in the 3T3 cell thyrnidine incorporation assay. It should be 
noted that in the absence of heparin, the mutant HBGF-l 
competes poorly with Ubeled wild-type HBGF-l m cross- 



type HBGF-l (Fig. 9 D) have acquired a more polar, elon- 
gated phenotype characteristic of transformed 3T3 cells. 
This phenorype is not seen in cells expressing neomycin re- 
sistance alone (Fig. 9 A) or in cells expressing relatively high 
levels of mutant HBGF-l (Fig. 9 C). It should he noted that 
we have not been able to detect HBGF-l uTununorenctivity 
in the media conditicned by these cells and that the cells ex- 
pressing relatively high levels of wild-Type HBGF-l show en- 
hanced growth in soft agar relative to unttinsfocted cells or 
cells expressing high levels of the mutant HBGF-l (dau not 
shown). These results are consistent with the results of the 
mitogenic assays described above which demonstrate that 
the growth-promoting activity of the mutant HBGF-I is rela- 
tively low when compared to the wild-type protein. 



Discussion 

The experiments described in this report were initiated as a 
result of the chemical modification studies of HBGF-l 
reported by Harper and Lobb (19). They demonstrated that 
reductive methylation of HBGF-I resulted in selective. 
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Flgwi 7 Effect cf diflLitni concentra- 
tions of wad-type and mutant HBGF4 
on c^bf mRNA le«cls> Serum starved 
NIH JT3 cdh were either left on- 
sbnRuated or treated with heparin (5 
USm!) and (A) 0.5 of/ml. (0) 1J0 ng/ 
ml. (O 5X) nt/ml. (D) tO at/ml wfid- 
rype or motM (nr> HBGF-I far 
the indicated tune periods. RNA was 
prepared and ued for RNA gel blot 
hybridization asing the c-£m DNA 
probe («pr*r panris) cr gryceralde- 
hyde 3-pho^hne deh^faoaenaae ONA 
probe (Itw pantO). 
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figure 8. WeMern blot analysis of 
HBGF-I in NIH 3T3 cJIs transect- 
ed with wild-type or mutant KBGF-1 
expression plasmids. NIH JT3 cells 
*rrc transfecied as dovrnbed in 
Materials and Methods. The figure 
shows the relative levek of HBGF-I 
icimunorcaclivity pment in tyutes 
of cells transfecied with wiW-type 
HBGF-I (lane /. clone producing 
relatively high level of HBGF-I. 
line 3, clone producing relatively 
low level of HBGF-I) normal NIH 
JT3 cells (lane 2), cells transfecied 
with pSV2nco alone (lane 4). and 
cells transfecied with mutant HBGF-I (lane 5). For each cell type. 
10* cells were lyved wnh I ml of 2x Laemmh sample buffer and 
a SO-nl aliquot «*ai used in the Western blot. 



linking assays (dan not shown). In addition, whereas the ap- 
parent affinity of the mutant HBGF-1 for immobilized hepa- 
rin is reduced, it does bind at ionic strengths (i.e.. MJ.3 M 
NaCl) that exceed those known to be physiologic. Thus, the 
data presented here indicate that the mutant can utilize the 
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presence of heparin to restore some (i.e., receptor-binding, 
tyrosine kinase activation, and procooocogene induction) but 
not all (i.e.. stimulation of ('^thymidine incorporation 
into DNA and endothelial cell proliferation) of the activities 
of the wild-type prow... : " "ly. it is of interest that the 
wild-type protein competes witn Uuei. a HBGF-I for recep- 
tor-binding and induces protooncogene expression at similar 
concentrations in the presence or absence of added heparin 
yet it requires added heparin in order to promote DNA syn- 
thesis and cell proliferation (Figs. 1, 2. 4, and 6; and data 
not shown). Thus, the relatively poor mitogen* activity of 
the mutant protein may be related to its reduced apparent 
affinity for heparin. The data presented here ckmotutrate 
that "high" affinity receptor-binding, activation of tyrosine 
kinase activity, tyrostr.*? pho^»horylation of specific sub- 
strates, and iii jt".'*jon of protooncogene expression may be 
necessary but are not. by themselves, sufficient to sustain a 
mitofctnic response to the presence of HBGF-I. These results 
are consistent with the observations of Escobedo and Wil- 
liams (12) who showed by site-directed mutagenesis of the 
PDGF receptor and cDNA trsitsfoctioo that mutants could 
be consfniutu lL* *to responsive to PDGF with respect 
to receptor tyrosine kinase acuvabon and increased phos- 
priaotylinositol turnover but did not elicit a uui of cnic re- 




fi**nrft Morpricrtc^ofWHmceUstran^ 

of the same MH JT3 celts analyzed by Western blot analysts in Ftg. 1 ^ shows ceils transacted with pSV2aeo only tad B-D show ceth 
ctyiransfected with pSV2neo and expression vectors for wiid-rype (0 and D) and mutant (O HBGF-I. The ceils ihmm in AT correspond 
to those expressing relatively high levels of HBGF-I (Fig Mane / ), whereas those shown in D correspond to (hose cmresshw relatively 

hiJe HBGF-I (Fig. a, lane J). * 1 
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POO c Sin.i'.arly. S^-rnn^v?" *t *J. (40) u;eJ 
similar methods to generate a system where the mutant re- 
ceptor could mediate an incrrjiv in c-fbs expression in re- 
sponse to PDGF but not actin reorganization or mitogenesis. 

The mitogenic deficiencirs of the mutant HBGF-I may be 
due to reduced biologicaj stability in tissue culture medium, 
reduced binding to cell surface proteoglycans, an altered in- 
tracellular stability, and/or an altered affinity for an intracel- 
lular receptor or binding protein, tt has been established that 
the presence of heparin protects HBGF-I from thermal and 
proteolytic inactivauon (28, 37). In addition, it has been 
shown that m Mabeled HBGF-I is relatively insensitive to 
lysosomal degradation after receptor-mediated endocytosis 
(14). There is no obvious difference in the susceptibility of 
wild-type and mutant HBGF-I to proteolytic cleavage by the 
conditioned media of NIH JT3 cells cultured in the presence 
of 10% calf serum. However, the relative resistance of wild- 
type and mutant HBGF-I to proteolytic modihoitiun in the . 
presence of target cells or after receptor-mediated endocyto- 
sis has not been established, tt is also possible that the mu- 
tant protein is more susceptible Chan the wild type to nonpro- 
leolytic inactivation. Further studies should reveal whethci 
the altered activities of the mutant HBGF-I are a conse- 
quence of its reduced apparent affinity for heparin. 

In summary, the data presented here demonstrate that the 
various functions of HBGF-I can be dissociated at the struc- 
tural level. The observation chat site-directed mutagenesis 
can be used to produce recombinant proteins with •normal" 
receptor-binding activity and reduced mitogenic activity in- 
dicates that similar methods could be used to produce potent 
antagonists of HBGF-I. More importantly, these results indi- 
cate that it may be possible through struciuie-functioft anal- 
ysis and site-directed mutagenesis to generate mutants that 
retain certain (i.e.. chemotactk, mitogenic, or heparin- 
binding) but not other biological functions characteristic of 
the wild-type protein. Finally, whereas the data presented on 
the receptor-binding and tyrosine kinase activation proper- 
ties of the ,)132E mutant demonstrate that a lysine residue 
at this position is not critical for these functions, it is still 
possible that methylation of a lysine at this position could 
lead to reduced receptor-binding activity of HBGF-I (19). 
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Transforming Growth Factor a: Mutation of Aspartic Acid 47 and - 
Leucine 48 Results in Different Biological Activities 
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To study the relationship between the primary structure of transforming growth factor a (TGF-a) and some 
of its functional properties (competition with epidermal growth factor (EGF) for binding to the ECF receptor 
and induction of anchorage-independent growth), we introduced single amino add mutations into the sequence 
for the fully processed, 50-amino-arid human TGF-a. The wild-type and mutant proteins were expressed in a 
vector by using a yeast a mating pheromone promoter. Mutations of two amino acids that are conserved in the 
family of the EGF-like peptides and are located in the caHwxy-lerminal part of TGF-a resulted in different 
biological effects. When aspartic add 47 was mutated to alanine or asparagine, biological activity was retained; 
in contrast, substitutions of this residue with serine or glutamic add generated mutants with reduced binding 
and colony-forming capadties. When leudne 48 was mutated to alanine, a complete toss of binding and 
colony-forming abilities resulted; mutation of leucine 48 to isoteudne or methionine resulted in very tow 
activities. Our data suggest that these two adjacent conserved amino adds in positions 47 and 48 play different , 
roles in defining the structure and/or biological activity of TGF-a and that the carboxy terminus of TGF-a is 
involved in interactions with cellular TGF-a receptors. The side chain of leucine 48 appears to be crucial either, 
indirectly in determining the biologically active conformation of TGF-a or directly in the molecular recognition 
of TGF-a by its receptor. 



Transforming growth factor a (TGF-a) is a polypeptide of 
50 amino acids. First isolated from a retro virus -transformed 
mouse celt line (9), it has subsequently been found in human 
tumor cells (10. 29), in the early rat embryo (18), and 
recently in cell cultures from the pituitary gland (23). TGF-a 
appears to be closely related to epidermal growth factor 
(EGF) structurally and functionally (19, 20). The two pep- 
tides apparently bind to the same receptor, and both induce 
anchorage-independent growth of certain nontransformed 
cells, such as NRK cells, in the presence of TGF-p (1). 

Comparison of amino acid sequences reveals about 35% 
homology among the EGF-like peptides (rat [27], mouse 
[25], and human [13] EGFs and rat (19) and human [12] 
TGF-as). Some viral peptides (Shope fibroma growth factor 
(M, vaccinia growth factor [2], and myxoma growth factor 
(30]) also share homologies with the EGF-like peptides. 

If TGF-a is involved in transformation, a TGF-a antago- 
nist could be an important therapeutic tool in the treatment 
of certain types of malignancies. An understanding of the 
conformational and dynamic properties of the TGF-a mole- 
cule is basic to the design of an antagonist. A hypothetical 
antagonist would bind to the same receptor as TGF-a, but 
would not induce the series of proliferative and transforming 
events induced by TGF-a. To obtain such a molecule it is 
necessary to dissociate interactions responsible for binding 
from those involved in signal transduction. We decided to 
approach the problem by way of site-directed mutagenesis of 
a human sequence of TGF-a. In this report we describe our 
first series of mutations, which were carried out at residues 
Asp-47 and Leu-48, in the carboxy-lerminal part of TGF-a; 
these two amino acids are highly conserved in the EGF-like 
family of peptides. We show that these two adjacent residues 
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play different roles in the structure and/or function of 
TGF-a. 

MATERIALS AND METHODS 

Cells. Normal rat kidney (NRK) cells were grown in 
Dulbecco modified Eagle medium containing 10% (vol/ vol) 
calf serum. 

TGF-a gene. The sequence of the 50-amino-acid human 
TGF-a was originally derived from a human TGF-a precur- 
sor cDN A (12). The coding sequence is preceded by an ATG 
methionine codon and followed by a TAA stop codon and is 
flanked by EcoRl restriction sites. This EcoR\ fragment 
combines the 59-base-pair EcoRl-Ncoi fragment from plas- 
mid pTE5 (12) with the 111-base-pair Ncol-EcoRl fragment 
from plasmid pyTE2 (11). The resulting EcoRI fragment was 
inserted in M13mpl8 for site-directed mutagenesis. 

Synthesis and purification of oligonudeotides and oligonu- 
cleotide-directed mutagenesis. The synthesis and purification 
of 20- to 27-nucleotide oligonucleotides were carried out as 
described previously (31). The one or two nucleotides re- 
sponsible for the mutation were located in the middle of the 
oligonucleotide. Mutagenesis was performed by published 
procedures (21, 33). The sequences of the mutant clones 
were verified by the method of Sanger et al. (25). 

Yeast shuttle vector. The vector YEp70aT contains a yeast 
a-factor pheromone promoter and prepro sequence for the 
expression of TGF-a (15). The mutant TGF-a coding se- 
quence was inserted in the EcoRI site of plasmid YEp70aT 
and expressed in the form of a fusion protein consisting of 92 
amino acids from the prepro sequence of the yeast a factor 
attached to the amino terminus of TGF-a (28). The yeast 
cleaves the precursor and secretes TGF-a with 8 amino acids 
fused to it (4 are encoded by the prepro sequence of a-factor, 
and the other 4 are encoded by the DN A sequence added to 
insert of the TGF-a gene). The last of these residues is a 
methionine, which allows the cleavage of the secreted fusion 
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pro tcujjby cyanogen bromide (CNBr) and ihe release of a 
matu^g&GF-ct (50 amino acids J (see Results). 

Yeast 7 strain and transformation; ~The yeast Saccha- 
rontyces <crc\isiae 20B-12 (MATa irpl pep4-3) (17) was 
obtained from the Yeast Genetics Stock. Center, Berkeley. 
C'd\if.:5.-cere\isiac 20B-12 was gr wn in YEPD medium {i% 
yeast .extract (Difco Laboratories I, 2% Bacto-Peptone 
[Difco|^2% glucose). When the culture reached an optical 
density at 660 nm of I. spheroplasts were prepared (14) for 
transformation. For each transformation we used 10 to 15 u.g 
of purified plasmid DNA. 

Partial purification of TGF-a mutants. At 3 days after 
transformation, rive individual colonies of transformants 
were grown to saturation in YEPD medium. The amount of 
protein in the yeast medium was measured by the method of 
Bradford (3). and Che amount of mutant TGF-a secreted in 
the yeast medium was determined by radioimmunoassay. 
The clones which secrete the highest amount of mutant 
TGF-a were used to grow a i-litcr culture in YNB-CAA 
medium (0.67% yeast nitrogen base, 20 g of glucose per liter, 
10 g of Casamino Acids [Difco] per liter). After the culture 
reached saturation (optical density at 660 nm of 10 to 12) (48 
h in an air shaker at 30°C), the yeast conditioned medium 
was dialyzed extensively against I M acetic acid in 3,000- 
molecular- weight cutoff dialysis tubing. Usually 250 ml of 
dialyzed culture was lyophilized, suspended in 10 ml of 70% 
formic acid, and treated with CNBr (molar excess of 500) for 
20 h at room temperature. The CNBr was subsequently 
evaporated, and the samples were lyophilized. CNBr-treated 
samples were suspended in 1 ml of 1 M acetic acid, loaded on 
a Bio-gel P30 column (30 by 1.5 cm (Bio-Rad Laboratories)), 
and eluted with 1 M acetic acid. Fractions of 1 ml were 
collected. Aliquots were lyophilized, suspended in binding 
buffer (minimum essential medium containing 1 mg of bovine 
serum albumin per ml and 25 mM HEPES [N-2-hydroxy- 
ethylpiperazine-/V'-2-ethanesulfonic acid; pH 7.4J), neutral- 
ized if necessary to pH 7.4. and tested in EGF-binding 
competition and soft-agar assays, as well in radioimmunoas- 
say. 

Radioimmunoassays. The amounts of TGF-a secreted in 
the yeast medium were determined by radioimmunoassay 
with the immunoglobulin G fraction of a polyclonal anti- 
body. 34D, raised against recombinant human TGF-a (4), in 
0.1 M Tris (pH 7.5MU5 M NaCI-2.5 mg of bovine serum 
albumin per ml. The amounts of partially purified TGF-a 
present in the P30 column fractions were measured by using 
the Biolope RIA kit with polyclonal antibody against human 
TGF-a (a gift from W. Hargreaves, Biotope), under dena- 
turing conditions, as recommended by the supplier. 

EGF binding competition assay and soft agar assay. Both 
EGF-binding competition and soft-agar assays have been 
described previously (1). 

RESULTS 

Rationale for mutations in the carboxyl terminus of TGF-a. 
Figure I shows the amino acid sequence of TGF-a in which 
the residues that are conserved among all the EGF-like 
peptides described thus far (EGF, TGF-a, and EGF-like 
viral proteins) are enclosed in bold circles. Among the 11 
conserved amino acids, there are 6 Cys and 2 Gly residues, 
which presumably play essential roles in determining the 
overall conformation of the molecule. We concentrated on 
the two conserved amino acids in the carboxyl terminus, 
Asp-47 and Leu-48. The Asp in position 47 is conserved 
among the EGFs and TGF-a (human or murine), but not 
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FIG. 1. Mutations in the carboxy terminus of human TGF-q. The 
amino acids conserved in all the family of EGF-like growth factors 
(human and murine EGFs and TGFs. as well as the gene products of 
the vaccinia virus [vaccinia growth factor), (he Shope fibroma virus 
(Shope fibroma growth factor}, and the myxoma virus [myxoma 
growth factorl) are enclosed in bold circles. The mutations of amino 
acids at positions 47 and 48 are indicated. Symbols: A, Ata; C, Cys; 
D. Asp; E. Gtu; F, Phe; G. Gly. H. His; I, lie; K, Lys; L, Leu; M, 
Met; N, Asn; P. Pro; Q, Gin; R, Arg; S, Sen T, Thr; V, Val; W, Trp; / 
Y. Tyr. 



among the EGF-like viral proteins (vaccinia growth factor, 
Shope fibroma growth factor, or myxoma growth factor), 
whereas Leu 48 is conserved among all the EGF-like pep- 
tides so far described. In both mouse and human EGF. the 
two corresponding residues (Asp-46 and Leu-47) are located 
near the surface of the protein (8, 22, 22a). We designed a 
series of mutations in these two positions. 

Asp-47 has been mutated to Glu, Asn, Ser, and Ala. Glu 
was chosen because it has the same charge as and a brger 
size than Asp: Asn has a similar side-chain structure, but is 
uncharged. Ser is smaller but still polar; Ala is smaller and 
nonpolar. 

Leu 48 has been mutated to He and Met, which are both 
large, nonpolar residues like Leu, and to Ala, which is 
nonpolar but smaller. We introduced the chosen mutations 
by site-directed mutagenesis of: the cloned human TGF-a 
gene, using synthetic oligonucleotides. 

Construction of the yeast a mating pheromone-hnman 
TGF-a plasmid. The TGF-a expression vector pyTEl il ig 
2) was constructed by using plasmid YEp70aT (15) which 
contains the 2u.m origin of replication and yeast TRP I gene 
for its replication and selective maintenance, respectively. 
YEp70aT also contains the yeast a-factor promoter, the 
a-factor prepro sequence coding for 89 amino acids, and the 
sequence for 3 amino acids resulting from the introduction of 
Xba\ and EcoKi sites. The human mature TGF-a sequence 
(12) is contained in a 170-base.pair EcoRi fragment which 
includes an ATG (Met) codon preceding the sequence of 
TGF-a and a TAA (stop) codon followed by 8 nucleotides. 
This TGF-a sequence was inserted in the unique EcoRl site 
of YEp70aT. Clones with the proper orientation were se- 
lected, and DNA was isolated for yeast transformation. 

Measurement of TGF-a secreted by S. certvisiae. The 
amount of total proteins secreted into the yeast culture was 
10 ± 1 ng/ml for wild-type as well as mutant TGF-a as 
determined by the method of Bradford (3). Before further 
purification was attempted, we wanted to determine whether 
the mutated TGF-a proteins were being secreted by th e 
yeast. The low pH of the yeast medium, as well as the acidic 
proteins secreted in the yeast culture, precluded biological 
assay of secreted mutants. Therefore, immunological meth- 
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MUTATION OF TRANSFORMING GROWTH FACTOR a 
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FIG. 2. Structure of the 5. cerevisiae 8.2-kilobase shuttle vector 
pyTEl. The secretion of the TGF-a gene is under the transcriptional 
control of the yeast a-factor promoter and prepro sequence <E3S ). 
The yeast 2|im origin of replication (f33) and the selective yeast 
TRPI gene (ESS ) are indicated. The TGF-a gene, preceded by an 
initiation (ATG) codon and followed by a stop (TAA) codon. is 
inserted in the £roRI site. Details are given in Materials and 
Methods and in Results. 



ods were used. Wild-type and mutant TGF-a's were se- 
creted at a level of 100 to 200 ngmil and 10 to 500 ng/ml. 
respectively (as determined by radioimmunoassay with poly- 
clonal antibody 34D). We thus estimate that the percentage 
of TGF-a secreted in the yeast culture is at least 1% of the 
loial protein secreted. We cannot yet assess whether the 
variations in the levels of secretion of different mutant 

I GF-« voteins are real or whether one single-amino-acid 
substm :.ion drastically affects the recognition by the anti- 
body. The latter hypothesis is the more likely, since the use 
of an ther polyclonal antibody (Biotope) under denaturing 
conditions enabled us to detect certain TGF-a mutants (such 
as 1 Ala 47J-TGF-a f in which the amino acid in position 47 of 
human TGF-a is mutated to an alanine) that were poorly 
detected by 34D, under nondenaturing as well as denaturing 
conditions. After the amount of TGF-a mutant proteins was 
estimated, the medium was extensively dial y zed against 1 M 
acetic id and lyophilized as described in Materials and 
Methods. 

Partial purification of yeast-secreted TGF-a. Although the 
yeast shuttle vector was constructed in such a way as to 
secrete TGF-a with 8 amino acids fused to the N terminus, 

II was often observed that a significant fraction of the 
secreted TGF-a was in a higher-molecular-weight fragment 
corresponding to the size expected from an uncleaved (un- 
processed) 92-amino-acid fusion protein. Since a Met had 
^een introduced at the N terminus of TGF-a and since 
'GF-q contains no Met in its sequence, CNBr treatment 
tould be used to cleave either of these 8- or 92-amino-acid 
germinal peptides and release the complete 50-amino-acid 

GF-a. Indeed, CNBr treatment of yeast-secreted proteins 
resulted in the conversion of high-molecular- weight TGF-a 
ine 6,000-molecuIar-weight species, as revealed by 
w estern immunoblot (data not shown). 

CNBr-cleaved samples (see Materials and Methods) were 
Purified on a Bio-Gel P30 column. Figure 3 shows the elution 
Profile i the proteins, as well as the results of a radiorecep- 
tor assay and a soft-agar assay performed on aliquots of the 
u mn fractions. The A 2g0 profile shows two major peaks of 
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FIG. 3. Purification of yeast-secreted wild-type TGF-a. The pu- 
rification procedure is described in Materials and Methods and in 
Results. Aliquots of every other fraction of the Bio-Gel P30 column 
were tested for their abilities to compete with l2S l-EGF for binding 
to the EGF receptor (A) and to induce colony formation (>62 |im) 
on NRK cells in soft agar in the presence of TGF-0 (1 ng/ml) (•). 
The A 2M profile of the proteins was determined ( ). 



eluted proteins, one corresponding to the void volume and 
the other one to proteins of molecular weight < 3,000. 
Aliquots of the column fractions were tested for their ability 
to compete with l25 l-EGF for binding to the receptor. The 
fractions that were the most active in this assay were located 
between the two major protein peaks, in an area where 
relatively few proteins eluted. Although some activity was 
found in the first protein peak (void volume), this was 
considerably reduced on treatment with stronger CNBr (data 
not shown). 

Aliquots of each fraction were also tested for their ability 
to induce anchorage-independent growth of NRK cells in 
soft agar in the presence of TGF-0 (1 ng/ml). The receptor 
binding and colony-forming activity superimposed almost 
exactly (Fig. 3). Analysis by polyacrylamide gel electropho- 
resis with silver staining, as well as by Western blot, of the 
column fractions shows that our purification procedure 
(CNBr cleavage followed by P30 sizing column) eliminates 
high-molecular-weight proteins (data not shown). Since pure 
TGF-a migrates in a broad band on sodium dodecyl sulfate- 
polyacrylamide get electrophoresis (32), this technique can- 
not be used for proper assessment of the degree of separa- 
tion of TGF-a from low-molecular-weight contaminating 
proteins. Nevertheless, within our detection levels the 
amounts of TGF-a present in the column fractions (detected 
by radioimmunoassay using the antibody from Biotope) 
correlated with the amounts observed on sodium dodecyl 
sulfate-polyacrylamide gel electrophoresis (data not shown). 

Comparison of binding and colony-forming activity of 
TGF-a partially purified from yeast media. It was important 
to show that wild-type TGF-a secreted from S. cerevisiae 
had the expected biological properties and that its activity in 
soft-agar and radioreceptor assays was equivalent. For these 
assays, the amount of EGF-competing activity present in the 
most active fraction of the P30 column of wild-type TGF-a 
was measured in terms of EGF equivalents. The dilution 
curve had a slope that was parallel to that of the EGF 
standard. This value was also used to measure the colony- 
forming activity of the partially purified wild-type TGF-a 
(with EGF as a standard in the assay). The colony-forming 
activity of the partially purified wild-type TGF-a corre- 
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TABLE I. Biological and biochemical activities of muiani Kif-^ ' 
proteins secreted by S. ccrcvisiae and partially purified 

EGF equivalence (ng/ml) in: - 



0.01 0.1 1 10 
EGF EQUIVALENT tng/m0 

FIG. 4. Correlation between the activities in the binding and 
colony-forming assay for the partially purified wild-type TGF-« 
secreted by S. cerevisiae. The activity in the radioreceptor assay of 
the peak fraction from the P30 column was determined in EGF 
equivalent concentration. The value obtained was used for the 
soft-agar assay. Colonies of >62 inn (A) and the EGF standard (•) 
are shown. 

sponded exactly to that of EGF (Fig. 4). Thus, we have 
partially purified a wild-type 50-amino-acid TGF-a showing 
the expected binding and colony-forming activities, which 
provides a reference substance for mutant TGF-as that 
might show a dissociation of binding and colony-forming 
abilities. 

Biological and biochemical activities of the partially purified 
TGF-a mutant proteins. Mutated TGF-as were expressed by 
using the yeast system and partially purified on Bio-Gel P30 
columns as described in Materials and Methods. Mutant 
TGF-as were usually obtained from two different clones of 
yeast transformants. The CNBr-cleaved samples were puri- 
fied through different Bio-Gel P30 columns for each mutant 
protein to avoid any possible contamination from one pep- 
tide to another. The purification profiles observed with the 
mutant TGF-as were similar to those obtained for the 
wild-type TGF-a. Aliquots of the P30 column fractions were 
tested in radioreceptor and soft-agar assays. For all mutant 
proteins, the highest activity in both assays was always 
found in the same fraction of the Bio-Gel P30 column effluent 
(peak fraction). Extensive purification of a series of mutant 
proteins for screening purposes is not practical. Therefore, 
we needed a quantitation system that would allow us to 
compare mutant proteins with each other. Thus, the amount 
of TGF-a present in the peak fraction was estimated by 
radioimmunoassay with an antiserum to native TGF-a (ob- 
tained from W. Hargreaves), under denaturing conditions, as 
described in Materials and Methods. All values given in 
Table I were obtained from the peak fraction. 

The controls done with the wild-type TGF-a showed (Fig. 
4; Table 1) that binding and transforming activity were 
equivalent. The yeast vector without a TGF-a insert did not 
secrete any EGF-like proteins, as determined by both radio- 
receptor and soft-agar assay. 

Two types of results were obtained upon assay of mutant 
proteins having different amino acid substitutions at Asp-47. 
In both (Ala-471-TGF-a and [Asn-471-TGF-a, binding abil- 
ity was retained. Soft-agar and radioreceptor activities cor- 
related for [Asn-47KTGF-a; there was a lower value for 



Insert in the yeasi 
expression vecior 



Radioreceptor 
assay 



AmtofTGF-c 

Soft-agar <ngftn|» i n 

assay radioimmunoassay 



Wild-type TGF-a 
None 

lAla-47|-TGF-a 

lAsn-47J-TGF-a 

[Glu-471-TGF-a 
[Ser-471-TGF-a 
(Ala-48J-TGF-a 

[Ue-481-TGF-tt 

(Met-48|-TGF-a 



700 
400 



100 
66 

80 
75 

3 

10 

0 
0 

4 

2 



700 
300 



44 
48 

72 
72 

3 

4 

0 
0 

12 

7 



2,000 

Nrr 



2 

0.5 



220 
ND 

180 
525 

42 

60 

16 
220 

470 
490 

0 * 

453 

420 



* ND. Not determined. 

colony-forming activity than for EGF-binding competition 
for [Ala-471-TGF-a. [Ser-47)-TGF-a and [Glu-47}-TGF-a 
appeared to have lower activities in both assays than either 
wild-type TGF-a or lAla-47)-TGF-a and [Asn-47|-TGF-a. 
These results indicate that neither the carboxyl charge nor 
the polarity of Asp-47 is essential for biological activity. 

The effects of mutation of Leu-48, one of the 1 1 amino 
acids perfectly conserved among all the EGFs, TGF-as, and 
viral EGF-like proteins, are dramatic. [Ala-48r-TGF-a to- 
tally lacked binding and colony-forming activity, llle- 
48)-TGF-ci and {Met-48FTGF-a had very Utile biological 
activity compared with wild-type TGF-a. Another substitu- 
tion, lMet-48r-TGF-a, resulted in a truncated mutant tacking 
the last 2 amino acids and having a substitution of Leu to 
homoserine at position 48 following treatment with CNBr. 
Alternatively, if [Met-481-TGF-a was not treated with 
CNBr, fusion proteins of TGF-a (mutated to Met in position 
48) with 8 or 92 amino acids attached at the N terminus were 
obtained. Very low activities in binding and soft-agar assays 
were found for these mutants, whether or not they were 
cleaved with CNBr. Experiments on EGF and TGF-a have 
shown that an N-lerminal extension does not markedly 
modify EGF-binding activity (12, 26). Therefore, the loss of 
activity obtained with [Met-481-TGF-a that has not been 
CNBr treated was probably due to the mutation itself and 
not to the N-lerminally extended fusion protein. We do n i 
know whether the loss of activity observed with the TGr-a 
shortened to 48 amino acids and having a substitution ot 
Leu-48 to homoserine is due only to the mutation or also t 
the lack of the last 2 amino acids. . 

The data obtained by radioimmunoassay on the partiany 
purified wild-type and mutant TGF-a show that the amount 
of TGF-a detected was always higher than the a moU " 
determined by measurement of biological activity. This may 
be due to the presence in the fraction of a certain P c . rcent ^ a 
of incorrectly folded TGF-a that might be recognized in 
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. radioimmunoassay under denaturing conditions bui would 
not be biologically active. None of the mutant proteins 
seemed to be present in amounts equivalent to those ob- 
served for wild-type TGF-a in the partially purified fractions 
(whether radioimmunoassay, radioreceptor, or soft -agar as- 
say was used for quantitation). It is not ^lear whether 
consistently less TGF-a was produced by the mutant con- 
structs than by the wild type or whether the secreted mutant 
proteins were simply less well recognized by the antibody 
Because of these uncertainties, the biological activities of 
the different mutant proteins cannot be accurately related to 
a known amount of mutant TGF-a protein. Even though 
radioimmunoassay should be used with caution for a quan- 
titative evaluation of mutant TGF-a proteins, a positive 
reaction demonstrates that immunoreactive TGF-a was 
present in the P30 peak fraction for each mutant. Therefore 
thi Tact that one of the mutant proteins ([Ala-48)-TGF-a) is 
biologically inactive can be attributed to the mutation itself 
and not to the lack of production of the mutant protein by the 
yeast or its loss through purification. However, if the mutant 
nSi^S f3Ct immu nor«»ctive as the wild type, then 
^'""^rj 0 ^ and fAsn-J7J-TGF-a are as active as wild- 
fype TGF-a and [Glu-47r-TGF-a and [Ser-47J-TGF-a are 
less active; ,n contrast. [Ile-48J-TGF-a and [Mei-48J-TGF-a 
are almost inactive. The differences between mutation of 
1-47 and Leu-** would then be even more striking. 

DISCUSSION 

TGF-a shows sequence homologies with EGF. and both 
growth factors share the same cellular receptors (20). Even 
though EGF was discovered 25 years ago (7) and its prop- 
erties have been extensively studied over the years (5) the 
binding site of EGF to its receptor has still not been 
0. crmined and the relationship between structure and 
unction of EGF/TGF-a is still to be discovered. Particu 
larly we do not know whether binding to the receptor and 
signal transduction occur through one or more domains of 
the molecule or through which amino acids. We approached 
ihe question by performing site-directed mutagenesis of 
acidVYc " J° C T, d 0U I o a " en,i0n °" 'wo »<«accnt amino 

12 ft- a " d Le J U " 48 - loca,ed in the earthy «rminus 

and highly conserved in the EGF-like family of peptides 
" ex pectedly. these two amino acids showed very different 
A.,*'! T. t, !,V. 0 ^ U i a,i0n and Particularly to a substitution to 
Ala. [ Ala-47 Y-TG F-a retained binding and colony-forming 

c !v! !« ™ her T (A ' a - 48 !r TGF - completely lost both 
activiues. These data show that Asp-47 and Leu-48 play very 

rr V 61,1 in definin8 ,he structure and/or the activity of 
lOh-a. The other mutations performed on Asp-47 were 

IaK^V 0 ASn ' SCr - 3nd ° |U - f A sn-47r-TGF-a. like 
I ' V* ,o C,ive in bindin e a " d induction of 

^"lony formation, but [Ser-47|-TGF-a and [Glu-47|-TGF-a 
v .wed weaker growth factor activities. These results indi- 
te that neither the carboxyl charge nor the polarity of 
ofThl tr7?? tM f °, r 0io,o 8 ical «'i v "y. Interestingly. ,w 0 
V kC V,ral P ro,eins - myxoma growth factor and 

^nope fibroma growth factor (6. 30). have Asn instead of Asd 
'n position 47; we have shown that [Asn-47|-TGF-a retains 
biological activity. icwins 

Substitution or Leu-48 to Met and lie led to mutant 
Prote.ns with very low activities, whereas substitution to Ala 
11 to complete loss of activity. We did not expect that a 
" 'tation of Leu to He (which have similar sizes and polari- 
"«) would cause such a strong effect. Thus. Leu-48 which 
"> conserved perfectly among all the EGF-like peptides 



seems to be essential, through its exact geometry, f r the 
biological activity of TGF-a. 

The mutant proteins tested so far. when active, showed 
parallel behaviors in binding and colony formation. Some 
mutant proteins lost all activities, and we assume that the 
binding capacity has been lost. We have not been able to 
dissociate the binding and colony-forming abilities by usine 
any of the present series of mutant proteins, and it is 

of TGF-a ,0 SCrCCn m ° rC ° f ' hem SCarch ° f an ama * on '« 
Results relating to the biological activity of EGF show that 
derivatives of mouse EGF and human EGF (EGF 1-47 
lacking the carboxy-terminal 6 amino acids as a result of 
enzymatic digestion are less potent than the intact molecule 
in mitogenic stimulation of fibroblasts, but retain full bioloe- 
ical activity in in vivo assays (inhibition of gastric acid 
secretion) (16). On the other hand, naturally 5 occurring 
truncated forms of rat EGF. which lack the carboxy-terminal 

r m a prp°,« dS <rEGF M8) are as P° ,em 35 ™>* EGF 
(mhGh 1-53) in receptor-binding and mitogenic assays (27) 
We do not know whether the discrepancies observed are due 
to the origin of the molecule (artificial or natural) or to the 
type of bioassay used. In any event, all of these EGF-related 
molecules, which are shorter than mouse or human EGF 
still retain Leu-47. W f have shown that in TGF-a the 

StJ° reSidUe ' LCU ' 48, " Cri,ical for the bM °gi<al 

cr^Z da,a .° L n ,he ,hre e-dimcnsional structure of mouse 
obtained by nuclear magnetic resonance show that 
even though Asp-46 and Leu-47 (Asp-47 and Leu-48 in 
lGh-a) are both solvent accessible (8. 22, 22a). their side 
chains point in opposite directions in the beta-sheet struc- 
ture. Therefore, the role of these adjacent amino acids in the 
structure and. consequently, the function of EGF micht be 
very different. Our data show that the amino acids Asp* 
and Leu-48 of TGF-a are not equally important for the 
biological activity of TGF-a. despite the.V conservation 
among the EGF-like peptides. From the dramatic loss™ 
biological activity which is characteristic of mutation of 

binding to the cellular receptors either by direct interaction 
tte iSS 0f PTOVidin8 ,hC Pr0per confon «Wton to 
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AFFINITY CHROMATOGRAPHY APPLICATIONS INDEX 

ISy conditions and limitations of the method. 

l^Si«h wlteaUons must be viewed as investigatory, however; you should expect to have to do some or many tnals to 
IlSrrive at the best combination of medium and conditions. 



^arrive i 
^f MpurificationyBinding ot 

^adenosine monophosphate 
^albumin 

Ifamino acids, o and l 
^antibodies 

Pavidin-labeled compounds 
if bilirubin 

^biotin and biotin-labeled compounds 

^f&philic bacteria 
< HeLa cells 
4 human erythrocytes 
mouse thymocytes 

^chorionic gonadotropin 

coagulation factors 

\ coagulation factor II 

i, plasminogen 

* plasminogen activators 
m thrombin 

Mwditiydrofolate reductase 
©endotoxins (pyrogens) 
enzymes 

; N-acetylhexosaminidase 
..." adenosine deaminase 
adenosine kinase 
alcohol dehydrogenase 
alkaline phosphatase 
aminoacyl tRNA-synthetase 
aminopeptidase 
anhydrochymotrypsin 
anthranilate synthetase 
carbohydrate-metabolizing enzymes 
carbonic anhydrase 
carboxypeptidase A 
casein kinase 
cholesterol oxidase 
orchymotrypsin 
cytochrome oxidase 
DAHP synthetase, Tyr-sensitive 
dehydrogenases 

detoxifying enzymes 
dihydrofolate reductase 
dihydroneopterin triphosphate synthetase 
enterokinase 
orgatactosidase 
0-galactosidase 



galactosyttransferase 

glucuronidase 

glutamate dehydrogenase 

giutamate synthetase 

glutathione S-transferase 

glyceraldehyde 3-phosphate dehydrogenase 

glyceraldehyde dehydrogenase 

glycogen phosphorylase b 

glycogen synthetase 

hexokinase 

histone kinase 

kallikrein 

kinases 

lactic dehydrogenase 



Suggested ligand 

acriflavine 1 
cholic acid* 
albumin" 

protein A; protein G; anti-lg; serum proteins 

biotin 

albumin" 

avidin; streptavidin 

lectins on macroagarose (agarose macrobeads) 

w-aminohexyl" 1 

lens culinaris lectin 4 * 

antkabbit IgG" 

protein A" 

concanavalin A s 

heparin" 

sulfated dextran"; heparin" 
L-lysine" 

p-aminobenzamidine" 

m-aminobenzamidine"; p-aminobenzamidme" 
Cibacron blue 3GA m 
histamine"; polymyxin B" 
' amino acids; dyes; nucleotides/cofactors 
concanavalin A* 
w-aminohexyl 1 " 
adenosine S'-monophosphate" 
Cibacron blue 3GA 1 "; adenosine S'-monophospnate" 
L-histidyldiazobenzylphosphonicacid" 

aminohexyl 1 " 
w-aminohexyl'" 
trypsin inhibitor" 
L-tryptophan" 
carbohydrates 

p-aminomethylbenzenesulfonamide" 

0- tryptophan" 
a-casein"; heparin" 
cholesteryl hemisuccinate" 
4-phenylbutylamine" 
cytochrome c" , 

1- tyrosineV ..««».,. 
nucleotides; NAD; NADP; Cibacron blue 3GA"» 
reactive red 120 11 ' 

glutathione 

methotrexate" ; Cibacron blue 3GA' 1 ' 
guanosine 5'-triphosphate" 
p-aminobenzamidine° 
o(+)melibiose 1M 

p-aminobenzyl Mhio-0-D-galactopyranoside*' 

p-aminophenylMhiogalactopyranoside 31 

p-aminobenzyl l-thio-0-D-galactopyranoside 51 

a-lactalbumin"; N-acetyl-D-glucosamine" 

saccharolactone 1 " 

w-aminopentyt 

Cibacron blue 3GA 1 

cholic acid" 

reactive blue 72; pentyl 

adenosine S'-monophosphate" 

(o-aminohexyl 12 ' ; butyl" 4 

w-aminobutyl" 1 ; uridine 5'-diphosphate T1 

N-acetyl-D-glucosamine ,M 

heparin 1 " 

trypsin inhibitor" 

nucleotides; ATP; Cibacron blue 3GA" 
0-nicotinamide adenine dinucleotide" 



For information on the use of this price list see page 7. 




CHROMATOGRAPHY 



AMINO ACID RESINS 



AM 



. General References: ^ Biol( 29t 647 (1970). 

Deutsch, D.G. and Mertz, E.T., Proc. rea. «mc j 

Deutsch, D.G. and Mertz, EJ., Science, 170, WW™ 
Vverto, M. and Vaheki, A., Biochem. J.,-183, 331 (19/*). 



imi 13.30 
5 ml 43.20 
10 ml 71.45 



A 3«5^ A |!wri2 Cross-linked 49* beaded 
IS3) agarose 

<H£» Activation: cyanogen bromide 
Attachment: amino 

Spacer. 1 atom . 
Ligand immobilized: 2-8 ^otes per ml 
Form: Suspension in 2.0 M NaCI containing 0.0294 
thimerosal 



1 ml 13.30 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



j+> l-ALANINE 
A 3519 Matrix: 4H beaded agarose 
ERga Activation: cyanogen bromide 
Attachment: amino 

Spacer 1 atom - . 

Ligand immobilized: 2-7,imoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.02H 
thimerosal 
l-ARGININE 



C7r96 L ^f4^:beadedagarose gig : 4»| 

Activation: cyanogen brom.de 10 m 82,5Q| 
Attachment: ammo « mi i0 ^ ltf a 

Ug'an^ 

: Form: Suspension in 0.5 M NaCI, 0.01 M cira 
pH 4 5, con taining 0.0 W th imerosal 

TCYSTINE H 

I C 9266 Matrix: agarose ; 



lml 

5 ml 43.201 
10 ml 71.451 
25 ml, 142.201 



/** l-GLUTAMIC ACID 

|G2759 Matrix: m beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 

Spacer. 1 atom , . 

Lleand immobilized: 5-10 ^moles per m ■ 
Form: Suspension in 2.0 M NaCI containing 0.02H 

thimerosal 



lml 13.90 

5 ml 44.70 

10 ml 74.25 

25 ml 148.20 



Matrix: m beaded agarose 
A 1018 Activation: cyanogen bromide 
IS3I Attachment: amino 
Spacer. 1 atom 

Ligand immobilized: 5-10 M moles 
Form: Suspension in 2.0 M NaCI containing 0.02H 

thimerosal : ; , non 

Xml 10.80 



l-GLUTAMINE 



^ Matrix: 4H beaded agarose 
G 2884 -Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 5-10-jimoles 
Form: Suspension in 2.0 M NaCI 



lml 13.301 

5 ml 43.201 

10 ml 71.451 

25 ml 142.2M 



i 



5 ml 34.95 
10 ml 57.80 



r Matrix: Cross-linked 494 beaded 
A 8405 agarose 
reel Activation: epoxy 
Attachment: amino 
Spacer 12 atoms • 
L iffand immobilized: 1-3 fimoles per ml 

^IS* P° wder stab " ize , d w,th ,actose 

Swelling: 1 g swells to approx. 12 ml 



I ml 7133ft 
. 5 ml m% 

25 ml 142H 



i-ASPARAGINE 



1 ml 13.30 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



Matrix: 4W beaded agarose 
A 1 143 Activation: cyanogen bromide 
rerci Attachment: amino 
Spacer. 1 atom 

Ligand immobilized: 5-10 fimoles 

FornT: Suspension in 2.0 M NaCI containing 0.02% 

thimerosal - — - 

• 13.30 



t» Matrix: Cross-linked 4% beaded 
I G 9644 agarose 

Activation: epoxy 
Attachment: amino 

Srand immc^ 0.5-1.0 ^mole per ml 
Form: Suspension in 2.0 M NaCI containing 0.02 

thimerosal : — tSTuSj 

j» GLYCINE j c m | 47.15; 

|G3009 Matrix: 4% beaded agarose^ ^ 

25 ml 155.«| 



Activation: cyanogen bromide 
Attachment: amino 

-thimerosal . — 

l-HISTIDINE 



1 ml 
5 ml 

10 ml 71.45 
25 ml 142.20 



Matrix: Cross-linked 4H beaded 
A 2047 agarose 
igrci Activation: epoxy 
Attachment: amino 
Spacer 12 atoms 

Ligand immobilized: 1-2 ^moles per ml 
Form: Suspension in 2.0 M NaCI containing 0.02H 
thimerosal 



1 ml ' 

10 ml w.»J 

25 ml 1352 



Matrix: 4H beaded agarose 
Jo 20 I H 3257 Activation: cyanogen bromide 
1 Attachment: amino 

Spacer. 1 atom 

Ligand immobilized: 2-10 jimoles 
Form: Suspension in 2.0 M NaCI containing <(| 

thimerosal _ . — r^T 

1 mi 



lml 21.20 

5 ml 68.50 

10 ml 113.80 

25 ml 227.15 



^ L-ASPARTICACID 
A 3394 Matrix: m beaded agarose 
rerrei Activation: cyanogen bromide 
Attachment: amino 
Spacer. 1 atom 

Lteand immobilized: 5-10 junoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.02H 
thimerosal 



I H 0133 



Matrix: Cross-linked 494 beaded ^ 

agarose . . iumi7$ 

Activation: cyanogen bromide .10 m - 

Attachment: amino 4 

&iSl le *10.20 C o, e sper,n. ill 

Form: Suspension in 0.5 M Nau _ — 
(Continui 



S product 

'NUMBER 



(Continuation of) 
l-HISTIDINE 



% H 0767 

Pi,^"- 



Matrix: Cross-linked 494 beaded 
agarose 

Activation: epoxy 
Attachment: amino- ■ 
•Spacer 12 atoms ; 
Ligand immobilized: 1-2 pmoles 
Form: Suspension in 2.0 M NaC 
thimerosal . ■ " 



"For shipping restrictions and related costs see page 5. 



' i*. l-ISOLEUCINE 
£14505 Matrix: 4*6 beaded agarose 
a Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 3-7 pinoles 

Form: Suspension in 2.0 M Nat 

thimerosal 



,> l LEUCINE 
1 5506 Matrix: 4H beaded agarose 
] Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 2-10 pmole , 
Form: Suspension in 2.0 M NaC 

L-LVSINE \ 

it- Matrix: 4H beaded agarose c 
I& 5631 Activation: cyanogen bromide 
Attachment: amino . c 
Spacer 1 atom 

Ligand immobilized: 4-7 pinoles 
per ml ..... 
Form: Suspension in 2.0 M Nat ( 

thimerosal \ 

Oa Z Matrix: Cross-linked 4H beadec 
1 9268 agarose 

Activation: epoxy 
Attachment: amino 
Spacer 12 atoms 
Ligand immobilized: 1.5-2.5 M m 
Form: Suspension in 2.0 M Nat 
thimerosal 

Kt ... >■ Matrix: Sepharose 4B (pfs) 
£16132 Activation: cyanogen bromide 
0 Attachment: o-amino , 
Spacer 1 atom 
: Ligand immobilized: approx. 4 , 
; Form: Lyophilized powder eta 
and dextran . 

l-METHIONINE 
|J|SpH) Matrix: 4% beaded agarose' 
Activation: cyanogen bromide 
K Attachment: amino 

Wti Spacer 1 atom 

Ugarid immobilized: 2-10 
: Form: Suspension in 2.0 M Nat 

Jfc i PHENYULANINE 
S*°18 Matrix: 4H beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
:: Spacer 1 atom 
L'gand immobilized: 2-10 pmol< 
Form: Suspension in 2.0 M Na" 

For inforr 



CHROMATOGRAPHY 



AMINO ACID RESINS (continued) 



sand enzymes iswell.f,^ 



PRODUCT 
NUMBER 



I PRODUCT 
US I 1 NUMBER . 



US.S 



otein se| 



(Continuation of) 
l-HISTIDINE 



1 ml 12.80 

5 ml 41;00: 

10 ml 68.30 

25 ml 135.75 



US J 



;arose . 
bromide 



1 rfll 15:60 
5 ml : «'.70 ; 
10 ml 82.&0 ^ 
25 ml 164:10 



) 5-1.0 ftmole per ml - tr J .., 
! 0.5 M NaCI, 0.01 M citrate^ 11 
A ^ thimerosal ;J 
Inquire . 

" rml 13.30 , 

agarose 5 ml «.20 . 

; 5-10 /tmoles per m ' . n9 ^v 
in 2.0 M NaCl containing 0.0W 



i _ 
en bromide 

d: 5-10 /tmoles 
, in 2.0 M NaCI 



lml 13.30^1 

5 ml 4 .20^ 

10 ml 71.45: . 

25 mi 142.20 ;. 



ed 4H beaded 



10 



linked 4W beadea 



I+- Matrix: Cross-linked 49* beaded 
H0767 agarose 
grei Activation: epoxy 
Attachment: amino 
Spacer: 12 atoms ; • 
Ligand immobilized: 1-2 /tmoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.0294 
thimerosal ' 



i-ISOLEUCINE lm 13.10 

Matrix: m beaded agarose 5 ml 41.65 

Activation: cyanogen bromide 10 ml 68.65 
Attachment: amino 25 ml 136.25 

Spacer: 1 atom 

Ligand immobilized: 3-7 /tmoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.02% 
thimerosal 



5 ml \ 

10 ml n.«,^l 

25 ml 

ed: 0.5-1.0 (iraole per mi .jjjfjl 
,„in2.0NINaClconta,t.mgO.O^|j 



5 ml 

Jed agarose 78J g| 

ogenbrom.de 10 mi ^ jM 

ino " 

iied- 2-10 amoles per ml 



aded agarose 1 * 4 tjfl 

mogen bromide g' ffl| ggijl 

nnino 25 ml i3Sg| 

iHized: 2-10 /tmoles 
15 ion in 2.0 M NaCI containing^ 

"^rmT^S 

yanogen bromide ■gMS 
amino JS 1 
Sized: 10-20 ^oles per ml | 
insion in 0 : 5Mj}aCl____^> 
(Continug 

page 5. 



>- l-LEUCINE 
) L 5506 Matrix: 4% beaded agarose 
m Activation: cyanogen bromide 
Attachment: amino 
Spacer: 1 atom 

Ligand immobilized: 2-10 /tmoles per ml 
Form: Suspension in 2.0 M NaCI , 



l-LYSINE 



1 ml 14.00 
5 ml; 45.40 
10 ml 75.25 
25 ml 149.55 



Matrix: 4% beaded agarose 
■ L 5631 Activation: cyanogen bromide 
Jv. Attachment: amino 
Spacer: 1 atom 

Ligand immobilized: 4-J /tmoles 
per ml • n fl ,„, 

Form: Suspension in 2.0 M NaCI containing 0.029* 
thimerosal 



1 ml 15.70 

5 ml 51.95 

10 ml 86.55 

25 ml 173.00 

50 ml 311.60 



1 ml 15.70 

5 ml 51.95 

10 ml 86.55 

25 ml 173.00 



^flp . Matrix: Cross-linked 494 beaded 
il 9268 agarose 

a Activation: epoxy 
Attachment: amino 
Spacer: 12 atoms 

Ligand immobilized: 1.5-2.5 /tmoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.029* 

thimerosal : 

>- Matrix: Sepharose 4B (pfs) Hf 46-45 

&C6132 Activation: cyanogen bromide 5 g 154.75 

Attachment: o-am ino 15 g 320.00 

Spacer: 1 atom 

Ligand immobilized: approx. 4 /tmoles per ml 
Form: Lyophilized powder stabilized with lactose 
and dextran 



tilt ^METHIONINE 
IffiPlO Matrix: 4% beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
Spacer: 1 atom 

Ligand immobilized: 2-10 fimoles per ml 
Form: Suspension in 2.0 M NaCI 



13.30 
43.20 



1 ml 
5 ml 

10 ml 71.45 
25 ml 142.20 



^PHENYLALANINE lml 15 

fc^lS Matrix: 4% beaded agarose 5 ml 48 

Activation: cyanogen bromide 10 ml 92. 

'Attachment: amino 25 ml 184. 
j*?'- 1 - Spacer I atom 

' Ligand immobilized: 2-10 /tmoles per ml 
Form: Suspension in 2.0 M NaCI 



1 ml 13.30 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



y*. l-PROLINE 

3268 Matrix: AH beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 2-10 /tmole per ml- 
Form: Suspension in 2,0 M. NaCI containing 0.029* 
thimerosal ■ 



^ i-SERINE 

3881 Matrix: 4W beaded agarose 
I Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 2-10 /tmoles per ml 
Form: Suspension in 2.0 M NaCI 



1 ml 
5 ml 
10 ml 
25"ml 



13.30 
43.20 
71.45 
142.20 



1 ml 
5 ml 
10 ml 



12.90 
42.00 
69.40 



^ l-THREONINE 
0387 Matrix: 4% beaded agarose 
3 Activation: cyanogen bromide 

Attachment: amino 25 ml UbUo 
Spacer 1 atom - 
Ligand immobilized: 2-10 /tmoles per ml , 
Form: Suspension in 2.0 M NaCI containing 0.029* 
thimerosal . 



o-TRYPTOPHAN 
3762 Matrix: m beaded agarose , 
S Activation: cyanogen bromide 
Attachment: amino. .. 
Spacer 1 atom : ; 

Ligand immobilized: 1-3 /tmoles per ml 
Form: Suspension in 2.0 M NaCI 



1 ml 13.50 

5 ml 44.25 

10 ml 72.40 

25 ml 144.15 



1 ml 13.50 

5 ml 44.25 

10 ml 72.40 

25 ml 144.15 



l-TRYPTOPHAN 
0137 Matrix: 4% beaded agarose 
rgrci Activation: cyanogen bromide 
Attachment: amino 
Spacer: 1 atom 

Ligand immobilized: 1-3 /t.motes per ml 

Form: Suspension in 2.0 M NaCI containing 0.029* 

thimerosal 



i-TYROSINE 



1 ml 14.85 

5 ml 45.00 

10 ml 74.50 

25 ml 148.50 



>• Matrix: 4H beaded agarose 
T 0262 Activation: cyanogen bromide 
m Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 5-10 /tmoles 
per ml , . . . _.. . 

Form: Suspension in 2.0 M NaCI containing 0.029* 
thimerosal . . . . 



1 ml 17.40 
5 ml 55.90 
25 ml 185.20 



Matrix: 4% beaded agarose 
T 8016 Activation: epoxy 
rci Attachment: amino 
Spacer 12 atoms 

Ligand immobilized: approx. 20 /tmoles per ml 
Form: Suspension in 0.15 M sodium phosphate, 
pH 6.8, containing 0.029* sodium azide 



1 ml 13.10 

5 ml 41.65 

10 ml 68.65 

25 ml 136.25 



*- l-VALINE 
V 5877 Matrix: 4H beaded agarose 
Ssl Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 3-7 /tmoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.029* 
• thimerosal 



For information on the use of this price list see page 7. 
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BIOCHEMICALS 
ORGANIC COMPOUNDS 

DIAGNOSTIC REAGENTS 



TOLL FREE ORDERING 



ORDERING HOURS 

Monday through Friday-24 Hours a day 

Outside USA / Canada Call Collect . . • • ■ 

800-325-5052 

FAX usa /Canada 314771-5757 

Outside USA / Canada ' 

For TVYX, TELEX, Cable or Collect FAX calls see page 3. 

FOR BULK INQUIRIES CALL SAF 

See inside back cover for details. 



Customer/ Technical Service 



Customer Service^echnical Service telephone numbers located 
on card inside front cover and on page o. 



SIGMA 






o 



BIOCHEMICALS 
ORGANIC G0W1P0UNDS 

DIAGNOSTIC REAGENTS 



ms§§ 




TOLL FREE ORDERING 



ORDERING HOURS 

Monday through Friday-M Hows a day 

■■-■ : --«5«»5S 

Outside USA I Canada Call Collect 

800-325-5052 

FAX USA 'Canada 314-771-5757 

Outside USA /Canada •■• • " , ' 

For TOX, TELEX, Cable or Colled FAX calls see page 3. 

FOR BULK INQUIRIES CALL SAF 

See inside back cover for details. 



Customer / Technical Service 



Customer Service/Technical Service tetephone numbers located 
on card inside front cover and on page i. 
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CHROMATOGRAPHY 



ng the buf- 
jlumnata 
jsed to elute 
,ng effects 
required for 3 
order to have j 
: capacity 



'fer 96 



ive com- 
most listings 3|| 



AFFINITY CHROMATOGRAPHY APPLICATIONS INDEX 

Bohy conditions and limitations of the method, 
i jrrrve at the best combination of medium and conditions. 



we offer, a com- 



press, 



Kpftf Purificatiori/ Binding ol 

i'adenosine monophosphate 
i albumin 

htnino acids, o and l 
Antibodies , 
f avidin-labeled compounds 

I biotSfand biotin-labeled compounds 

halophilic bacteria 
HeLa cells 
human erythrocytes 
4 mouse thymocytes 
I chorionic gonadotropin 
i coagulation factors 
~ coagulation factor II 
J- plasminogen 
W plasminogen activators 
% thrombin 

f/dihydrofolate reductase 
I endotoxins (pyrogens) 
- enzymes 
N-acetylhexosaminidase 
adenosine deaminase 
adenosine kinase 
alcohol dehydrogenase 
alkaline phosphatase 
aminoacyl tRNA-synthetase 
aminopeptidase 
anhydrochymotrypsin 
. anthranilate synthetase 

carbohydrate-metabolizing enzymes 
; carbonic anhydrase 
carboxypeptidase A 
casein kinase 
cholesterol oxidase 
crchymotrypsin 
cytochrome oxidase 
OAHP synthetase, Tyr-sensitive 
dehydrogenases 

detoxifying enzymes 
dihydrofolate reductase 
dihydroneopterin triphosphate synthetase 
enterokinase 
ot-galactosidase 
0-galactosidase 



gatactosyttransferase 

glucuronidase 

glutamate dehydrogenase 

glutamate synthetase 

glutathione S-transferase 

glyceraldehyde 3-phosphate dehydrogenase 

glyceraldehyde dehydrogenase 

glycogen phosphorylase b 

glycogen synthetase 

hexokinase 

histone kinase 

kallikrein 

kinases 

lactic dehydrogenase 



Suggested ligand 

acriflavine 1 
cholic acid 1 
albumin" 

protein A; protein G; anti-lg; serum proteins 

biotin 

albumin" 

avidin; streptavidin 

lectins on macroagarose (agarose macrobeads) 

w-aminohexyl 11 * 

lens culinaris lectin** 

anti-rabbit IgG" 

protein A" 

concanavalin A* 

heparin 11 

sulfated dextran 14 ; heparin" 
t-lysine" 

p-aminobenzamidine M 

m-aminobenzamidine*'; p-aminobenzamidme" 
Cibacron blue 3GA 11 * 
histamine 1 *; polymyxin B 1 * 
' amino acids; dyes; nucleotides/cofactors 

concanavalin A* 
w-aminohexyl 1 " 

adenosine S'-monophosphate 7 ' > 

Cibacron blue 3GA' 1 *; adenosine S'-monophosphate 

L-histidyldiazobenzylphosphonic acid" 

w-aminohexyl 11 * 

w-aminohexyl 1 w 

trypsin inhibitor" 

L-tryptophan" 

carbohydrates 

p-aminomethylbenzenesulfonamide" 

D-tryptophan* 4 

a-casein"; heparin" 

cholesteryl hemisuccinate" 

4-phenylbutylamine" 

cytochrome c" 

L-tvrosine** 

. nucleotides; NAD; NADP; Cibacroa blue 3GA 1 " 

reactive red 120 11 * 
. glutathione 

methotrexate" ; Cibacron blue 3GA"' 

guanosine S'-triphosphate" 

p-aminobenzamidine 4 * 

ot+)melibiose 1M 

p-aminobenzyl Mhio-frD-galadopyranoside 1 

p-aminophenyl^thiogalactopyranostde'; 

p-aminobenzyl 1-thio-^galactopyranoside' 1 

a-lactalbumin"; N-acetyt-o-glucosamine" 

saccharolactone 1 " 

w-aminopentyl 

Cibacron blue 3GA' 

cholic acid" 

reactive blue 72; pentyl 

adenosine S'-monophosphate" 

w-aminohexyl 111 ; butyl 114 

w-aminobutyl 111 ; uridine 5'-diphospnate T1 

N-acetyl-D-glucosamine'" 

heparin 1 " 

trypsin inhibitor 11 

nucleotides; ATP; Cibacron blue 3GA" 
^-nicotinamide adenine dinucleotide" 



CHROMATOGRAPHY 



AMINO ACID RESINS 

Generd Reference*: ^ Exp ^ 29 , 647 (19 70). 

Deutsch, D.G. and Menz, t.i., riu*.. r nQ7m 
Detrtseh. D.G. and Mertz. EX, Scence, »»■ l 09 ^™' 
Vverto, M. and Vaheki, A., Biochem. J., 183. 331 (19/s>- 



Ar 




ftl AHlNF lml 13 30 

AsTs^Kcros^nked ^beaded S jJ «.20 

agarose 

Activation: cyanogen bromide 
Attachment: amino 
Spacer. 1 atom 

Ugand immobilized: 2-8 ,imoles per ml 
Form: Suspension in 2.0 M NaCI containing 0.02% 

thimerosal ^_ 

13.30 



lml lSi6l 
5 ml . 4978 
10 ml 82,51, 
25 ml 16U5 



1 ml 
5 ml 
10 ml 
25 ml 



i m 



l-ALANINE 
A 3519 Matrix: 4% beaded agarose 
G33 Activation: cyanogen bromide 
Attachment: amino 
Spacer. I atom 
i band immobilized: 2-7 /unoles per ml 
Fo'rm: : Suspension in 2.0 M NaCI containing 0.02% 
thimerosal 



/*■ l-CYSTEINE 
C 7896 Matrix: 4% beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 

Soacer. 1 atom , 
Ligand immobilized: 0.ML.C I P mole per ml 
Form: Suspension in 0.5 M NaCI, 0.01 Nl p 
dH 4.5, containing 0.01H thimerosal 

l-CYSTINE 
I C 9266 Matrix: agarose 



Inqui 



43.20 

71.45 16 2759 
142.20 



as 



1 ml 
5 ml 

10 ml -71.4! 
25 ml. 142.2 



l-ARGININE 



GLUTAMIC ACID 

Matrix: 4% beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 

Soacen 1 atom , _ ( 

Lteand immobilized: 5-10 Mmotes per m 
FormrSension in 2.0 M NaCI containing 0.0: 
thimerosal 



1 ml 13.90 

5 ml 44.70 

10 ml 74.25 

25 ml 148.20 



Matrix: 4% beaded agarose 
A 1018 Activation: cyanogen bromide 
f^vci Attachment: amino 
Spacer. 1 atom 
Ligand immobilized: 5-10 jimoles 

Form: Suspension in 2.0 M NaCI containing 0.0294 
thimerosal 



l-GLUTAMINE 



* Matrix: 4% beaded agarose 
| G 2884 Activation: cyanogen bromide 
Attachment: amino 
Spacer. 1 atom ■ 
Ugand immobilized: 5-10 jimoles 

Form: Suspension in 2.0 M NaCI 



lml 13.3 

5 ml 43.21 

10 ml 7U 

25 ml 142.2 



I r~ Matrix: Cross-linn 

H A 8405 agarose 

^1 Activation: epoxy 
Attachment: amin 
Spacer 12 atom: 
^^■H Ligand ImmobiRz 

^^^H Form: Lyophilize 

^^^H Swelling: 1 g swt 

■^^H l-ASPARAGINE 

f^^H Matrix: 4H bead 

A 1143 Activation: cyan; 



Matrix: Cross-linked 4% beaded 

&epo,y "-'"-Ml 
Attachment: amino 
Spacer 12 atoms 

Lfeandimmobinzed:l-3 P molesperml 
Form: Lyophilized powder stabilized with lactose 
Swelling: 1 g swells to approx. 12 ml 



7ml 10.80 I" * Matrix: Cross-linked 4% beaded 
5 ml 34.95 G 9644 agarose 



lml 
5 ml 
10 ml 
25 ml 



If 



Activation: epoxy 
Attachment: amino 

Spacer. 12 atoms , 
Ligand immobilized: 0.5-1.0 gnote per ml 
Form: Suspension in 2.0 M NaCI containing OA 
thimerosal 



L ml 



47.1! 



1 ml 13.30 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



Matrix: 4H beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 
Ligand immobilized: 5-10 ^moles 

Form: Suspension in 2.0 M NaCI containing 0.02% 
thimerosal _ : , 



1 

10 ml 78. 
25 ml 155. 



^ GLYCINE 
| g 3009 Matrix: 4% beaded agarose 
33) Activation: cyanogen bromide 
Attachment: amino 

Soacer 1 atom . 
Ligand immobilized: 2-10 ^moles per ml 
Form: Suspension in 2.0 M NaCI conta.n.ng 
■ thimerosal - 



i-HISTIDlNE 



1 ml 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



Matrix: Cross-linked 4% beaded 
A 2047 agarose 
rsrei Activation: epoxy 
Attachment: amino 
Spacer. 12 atoms 
Ligand immobilized: 1-2 ^motes per ml 
Form: Suspension in 2.0 M NaCI containing 0.029* 
thimerosal 

* l-ASPARTICACID J "J |*'|g 

A 3394 Matrix: 4H beaded agarose 5 ml 68.50 



1 ml 
5 ml 
10 ml 
25 ml 



Matrix: 4%.beaded agarose 
, H 3257 Activation: cyanogen bromide 
Attachment: amino 
Spacer. 1 atom 

Ligand immobilized: 2-10 P moles 
form: Suspension in 2.0 M NaCI containing 0j 
thimerosal 



10 ml 113.80 
25 ml 227.15 



maniA. ° ; , 

Activation: cyanogen bromide 
Attachment: amino 

Spacer 1 atom . 
Ligand Immobilized: 5-10 f moles per ml 
Form: Suspension in 2.0 M NaCI containing 0.02% 
thimerosal 



I J- Matrix: Cross-linked 4% beaded Ijjj 

,H013 ? &:cyanogenbromide .g«jj 
Attachment: amino 

Form: Suspension in^bMNau^ r 

(Continue 
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" For shipping restrictions and related costs see page 5. 



V PRODUCT 
■ NUMBER 



(Continuation of) 
l-HISTIDINE 



/+> Matrix: Cross-linked 494 bead 
7 agarose 
] Activation: epoxy 
Attachment: amino 
Spacer 12 atoms 
Ligand immobilized: 1-2 pmol 
Form: Suspension in 2.0 M h 
thimerosal 

^ i-ISOLEUCINE 
\ 1 4505 Matrix: 4% beaded agarose 
jp ESI Activation: cyanogen bromidf 
ft- Attachment: amino 
IVr Spacer 1 atom 

Ugand immobilized: 3-7 /tmol 
Form: Suspension in 2.0 M f 
thimerosal 



+ l-LEUCINE 
L 5506 Matrix: 4% beaded agarose 
afESl Activation: cyanogen bromid* 



Attachment: amino 
Spacer 1 atom 
Ugand immobilized: 2-10 *m 
Form: Suspension in 2.0 MM 



*1- ■ l-LYSINE 



^ Matrix: 4% beaded agarose 
iS31 Activation: cyanogen bromid- 
es) Attachment: amino 

Spacer 1 atom 

Ugand Immobilized: 4-7 /tmo 

per ml 

Form: Suspension in 2.0 M 1 

thimerosal 

Matrix: Cross-linked 4% bea 

9268 agarose 

SSI Activation: epoxy 
Attachment: amino 
Spacer 12 atoms 
' Ugand Immobilized: 1.5-2.5 

~* Form: Suspension in 2.0 M 

g thimerosal 

V >- Matrix: Sepharose 4B (pfs) 
6132 Activation: cyanogen bromic 

Attachment: o-amino 

Spacer 1 atom 
1? ; Ugand immobilized: approx. 
tf, Form: Lyophilized powder 
£V and dextran 

J|* l-METHIONINE 
Iff 5010 Matrix: 4% beaded. agarose 
Activation: cyanogen bromic 
Attachment: amino 
.. Spacer 1 atom 
Ligand immobilized: 2-10 *r 
Form: Suspension in 2.0 M 

^PHENYLALANINE 
E? 018 Matrix: 4% beaded agarose 
Activation: cyanogen bromi< 
Attachment: amino 
'Spacer 1 atom 
Ugand Immobilized: 2-10 ?\ 
Form: Suspension in 2.0 M 

For inl 



o 



U 



CHROMATOGRAPHY 



s and enzymes is well, 
otein separation. 



■ :.■■) 



.arose 
bromide 



lrnl 15.60. 

5 ml 49.70/. _ 
10 ml 82.50 f a 
25 ml 164.10 -^jf 



> 5-1.0 iimole per ml , 
! 0.5 M NaCl. 0.01 M crtrate/g 
OlH thimerosal 




lml 13.30.1; 
agarose 5 ml 43.p| 

5-10 itmoles per ml • 
in 2.0 MNaCI containing 0.02^1 



1 agarose 
en bromide 



d: 5-10 itmoles 
nn 2.0 M NaCl 
ed 4W beaded 



5m l 43.204 

10 ml 7J-S^4 
25 ml 142J0&J 

5 ml 43^ 
10 ml 

»o 25ml l « 

•ed: 0.5-1.0 ,unole per ml rj& 
min2.0MNaCUontamingO^ 

10 ml l™5& 
25 ml l5Sffl 



Jed agarose 
ogen bromide 
ino 



iied: 2-10 ^moles per ml 
SSn 2.0 6 Mad containing 0^ 



aded agarose JjJ <j- 

, n0 gen bromide J m , 
mino 25 m | 13! 

n 

.Ulzed: 2-10 /imoles 
lS ion in 2.0 MNaCI containing^ 



.-linked 4*6 beaded 

yanogen bromide 
amino 
Dm 



1 ml " 

5 41 
10 ml efl 
25 ml I 



JbSBzed: 10-20 ^moles per ml 
>n S i O nin0.5MNaCI^ _^. 
(Continue 



AMINO ACID RESINS (continued) 



PRODUCT 
NUMBER 



I PRODUCT 
US S I NUMBER 



US I 



(Continuation of) 
l-HISTIDINE 



1 ml 12.80 

5 ml 41.00; 

10 ml 68.30 

25 ml 135.75 



Matrix: Cross-linked 494 beaded 
H 0767 agarose 
(S3 Activation: epoxy 
Attachment: amino 
Spacer 12 atoms 
Ugand Immobilized: 1-2 ^moles per ml 
Form: Suspension in 2.0 M NaCl containing 0.0294 
thimerosal . 



P3268 



-PROLINE lm| J3.30 

Matrix: 494 beaded agarose 5 ml 43.ZU 

Activation: cyanogen bromide . 10 ml 71.45 
Attachment: amino 25 ml 14Z.ZQ 

Spacer 1 atom 

Ugand immobinzed: 2-10 ^mole per ml 

Form: Suspension in 2.0 M NaCl containing 0.02 W 

thimerosal • 



1 ml 13.10 

5 ml 41.65 

10 ml 68.65 

25 ml 136.25 



i-lSOLEUCINE 
1 4505 Matrix: 4H beaded agarose 
HS) Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ugand Immobilized: 3-7 jimoles per ml 
Form: Suspension in 2.0 M NaCl containing 0.02M 
thimerosal 



^ l-SERINE 
I S 3881 Matrix: 4H beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ugand immobilized: 2-10 ^motes per ml 
Form: Suspension in 2.0 M NaCl 



1 ml 13.30 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



i-LEUCINE 
1 5506 Matrix: 4*6 beaded agarose 
. [HS Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ugand immobilized: 2-10 jimoles per ml 
Form: Suspension in 2.0 M NaCl 



1 ml 14.00 
5 ml: 45.40 
10 ml 75.25 
25 ml 149.55 



1ml 12.90 

5 ml 42.00 

10 ml 69.40 

25 ml 138.15 



^ ^THREONINE 
I T 0387 Matrix: 4H beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ugand immobilized: 2-10 /imoles per ml 
Form: Suspension in 2.0 M NaCl containing 0.0294 
thimerosal ■ . 



l-LYSINE 



Matrix: 4% beaded agarose 
: L 5631 Activation: cyanogen bromide 
S3) Attachment: amino 
Spacer 1 atom 

Ligand immobilized: 4-7 pinoles 
per ml . „ rtrtft , 

Form: Suspension in 2.0 M NaCl containing 0.02K 
thimerosal 



lml 15.70 

5 ml 51.95 

10 ml 86.55 

25 ml 173.00 

50 ml 311.60 



o-TRYPTOPHAN 
I T 3762 Matrix: 4H beaded agarose 
I Activation: cyanogen bromide 
Attachment: amino 
Spacer. 1 atom - 
Ligand immobilized: 1-3 jimoles per ml 
Form: Suspension in 2.0 M NaCl 



1 ml 13.50 

5 ml 44.25 

10 ml 72.40 

25 ml 144.15 



1 ml 15.70 

5 ml 51.95 

10 ml 86.55 

25 ml 173.00 



1 ml 13.50 

5 ml 44.25 

10 ml 72.40 

25 ml 144.15 



^ „ ^ Matrix: Cross-linked 494 beaded 
5 L 9268 agarose 

3 Activation: epoxy 
Attachment: amino 
Spacer. 12 atoms 

Ugand immobilized: 1.5-2.5 ^moles per ml 
Form: Suspension in 2.0 M NaCl containing 0.0294 
thimerosal 



l-TRYPTOPHAN 
T 0137 Matrix: 494 beaded agarose 
g Activation: cyanogen bromide 
Attachment amino 
Spacer 1 atom 

Ligand Immobilized: 1-3 /tmoles per ml 
Form: Suspension in 2.0 M NaCl containing 0.0294 
. thimerosal 



i-TYROSINE 



„ fc *~ Matrix: Sepharose 4B (pfs) 
RC6132 Activation: cyanogen bromide 5g 154.75 I 

a Attachment: a-amino 15 g 320.00 | 

Spacer 1 atom 

Ligand immobilized: approx. 4 ^moles per ml 
Form: Lyophilized powder stabilized with lactose 
and d extran __ 

i£ L "METHlONINE 
■ W10 Matrix: 4W beaded agarose 
Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 
£V: Ligand immobilized: 2-10 M moles per ml 
Form: Suspension in 2.0 M NaCl 



Matrix: 494 beaded agarose 
1 g 46.45 |T 0262 Activation: cyanogen bromide 



1 ml 14.85 

5 ml 45.00 

10 ml 74.50 

25 ml 148.50 



1 



1 ml 13.30 

5 ml 43.20 

10 ml 71.45 

25 ml 142.20 



Attachment: amino 
Spacer 1 atom 
Ligand immobilized: 5-10 ftmoles 
per ml . nnn/ 

Form: Suspension in 2.0 M NaCl containing 0.0294 

thimerosal . . 

1 ml 17.40 
5 ml 55.90 
25 ml 185.20 



Matrix: 4H beaded agarose 
I T 8016 Activation: epoxy 
m Attachment: amino 
Spacer 12 atoms 

Ligand immobinzed: approx. 20 ^moles per ml 
Form: Suspension in 0.15 M sodium phosphate, 
pH 6.8, containing 0.0294 sodium azide 



p«m -PHENYLALANINE lml 15.25 

K? 18 Matrix: 4% beaded agarose 5 ml 48.15 

Activation: cyanogen bromide l° m ' 92 35 

Attachment: amino 25 ml 184.00 
Spacer 1 atom 

Ugand immobilized: 2-10 P moles per ml 
form: Suspension in 2.0 M NaCl 



1 ml 13.10 

5 ml 41.65 

10 ml 68.65 

25 ml 136.25 



a?'' 1 



»» l-VALINE 
I v 5877 Matrix: 4W beaded agarose 
331 Activation: cyanogen bromide 
Attachment: amino 
Spacer 1 atom 

Ligand immobinzed: 3-7 M moles per ml 
Form: Suspension in 2.0 M NaCl containing 0.0294 
thimerosal 



For information on the use of this price list see page 7. 
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Conserved Structure of Genes Encoding Components of Botulinum 
Neurotoxin Complex M and the Sequence of the Gene Coding - 
for the Nontoxic Component in Nonproteolytic 
Clostridium botulinum Type F 

Alison K. East, Matthew D. Collins 

Department of Microbiology, Institute of Food Research, Reading Laboratory, Earley Gate, Whiteknights Road, Reading RG6 2EF, UK 

Abstract. For investigation of the genes of proteins associated in vivo with botulinum neurotoxin 
(BoNT), polymerase chain reaction (PCR) experiments were carried out with oligonucleotide 
primers designed to regions of the nontoxic-nonhemagglutinin (NTNH) gene of Clostridium 
botulinum type C. The primers were used to amplify a DNA fragment from genomic DNA of C 
botulinum types A, B, E, F, G and toxigenic strains of Clostridium barati and Clostridium butyricum. 
The amplified product from all of these strains hybridized with an internal oligonucleotide probe, 
whereas all nontoxigenic Clostridia tested gave no PCR product and showed no reaction with the 
probe. The NTNH gene was shown to be located upstream of the gene encoding BoNT, thereby 
revealing a conserved structure for genes encoding the proteins of the M complex of the progenitor 
botulinum toxin in these organisms. The sequence of the NTNH gene of nonproteolytic C 
botulinum type F was determined by PCR amplification and sequencing of overlapping cloned 
fragments. NTNH/F showed 11% and 61% identity with NTNH of C botulinum type E and type C 
respectively. 



Botulinum neurotoxin is the causative agent of botu- 
lism. It is produced by strains of four different physi- 
ological groups (1-IV), all designated Clostridium 
botulinum [7], and some strains of C barati and C. 
butyricum [9, 10]. Strains of C botulinum are classified 
into seven types, A to G, depending on the antigenic- 
ity of BoNT produced [7]. The toxin is produced by 
these organisms as a progenitor toxin complex, which 
is found in three forms: (i) M (rmm ~ 300,000) 
consisting of BoNT (rmm ~ 150,000) and a nontoxic 
protein component of approximately equal size, (ii) L 
(rmm ~ 500,000), and (iii) LL (rmm ~ 900,000). 
The larger L and LL complexes have hemagglutinin 
activity and have been purified from group I (proteo- 
lytic) C botulinum producing BoNT/A and BoNT/B 
[14] and group III C. botulinum producing BoNT/C 
(L complex only) [17]. In contrast, group II (nonpro- 
teolytic) strains of C botulinum that produce BoNT/E 
have no hemagglutinin activity, and only the M form 

Correspondence to: A.K. East 



of the complex has been detected [14]. The situation 
is similar for type A strains, which cause infant 
botulism, with only the M form being observed, with 
no hemagglutinin activity or larger complexes present 
[12]. 

The gene encoding the NTNH component, which 
together with BoNT forms the M complex, has been 
sequenced for C. botulinum type C [17]. More re- 
cently, the same workers published the sequence of 
ent-120 from C botulinum type E [5] and the same 
gene of toxigenic C. butyricum [6], both of which show 
high homology with NTNH/C. We have reported the 
DNA sequence encoding the C-terminus of a protein 
showing homology with NTNH/C, upstream of BoNT 
gene in C botulinum types A, E, and F (group II), C. 
butyricum type E, and C. barati type F [16]. In this 
study we show that the gene encoding this component 
of the progenitor toxin complex is present in all 
BoNT-producing strains, irrespective of immunologi- 
cal toxin type, and is absent from strains not produc- 
ing BoNT. Furthermore, the NTNH gene is present in 
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Table 1. Details of species and strains tested 



Physiological 

Species Strain 0 Toxin group 



C. botulinum NCTC7272 A I 

Kyoto-F A(inf) I 

NCTC7273 B I 

Eklund 17B (ATCC 25765) B II 
Hobbs FT50 (Colworth 

194) B II 

Eklund 2B (Colworth 229) B II 

Colworth 151 B II 

Scott 2129B B II 

Beluga (Colworth) E II 

Hazen 36208 (ATCC 9564) E II 

Sebald P34 (Colworth 230) E II 

VH (Dolman) E II 

Langeland (NCTC 10281) F I 
Eklund 202F (ATCC 

23387) F II 
Hobbs FT10 (Colworth 

187) F II 

Craig 610 (NCIMB 4305) F II 

ATCC 27322 G IV 

C. botulinum VPI 4088 (Nontoxic type E, — II 
(nontoxigenic) NTE) 

VPI 10428 (NTE) — n 

VPI 2093-1 (NTE) — n 

VPI 3158 (NTE) _ jj 

4672U-1(NTB) — \\ 

Cbutyricum ATCC 43755 E — 

C.barati ATCC 43756 F — 

Chastiforme DSM 5675 — _ 



fl ATCC, American Type Culture Collection; DSM, Deutsche 
Sammlung von Mikroorganismen; NCIMB, National Collection of 
Industrial and Marine Bacteria; NCTC, National Collection of 
Type Cultures; VPI, Virginia Polytechnic Institute and State 
University. 

a common location upstream of the BoNT gene. We 
also present the sequence of the gene encoding the 
NTNH component from group II C. botulinum type F. 

Materials and Methods 

Bacterial strains. The strains of Clostridia used in this study are 
listed in Table 1. 

Isolation of DNA. DNA was isolated from clostridial strains as 
described previously [4]. Plasmid DNA was purified from E. coli 
with a Magic Mini-Prep kit (Promega, Southampton, UK) or by a 
modification of the alkaline lysis method [13]. 

PCR amplification. Oligonucleotides were synthesised on an Ap- 
plied Biosystems DNA synthesiser (MODEL 391, Warrington, UK), 
and PCR was performed with a Biometra thermal cycler (Maid- 
stone, UK). PCR was performed as described previously [4] with 25 
cycles of: 92°C for 1 min; 37°C for 1 min; 58°C for 5 min, and the 
product held at 4°C Template DNA was at a final concentration of 
1 ng/u.1, except for inverse PCR, where 4 ng/p,] was used. All 
primers were used at a final concentration of 4 ng/u.1. The position 
and sequence of the primers used are shown in Figs. 1 and 3. 



Cloning and transformation. Cloning of PCR products was as 
described previously [4] or by use of a TA cloning kit (Invitrogen, 
Witney, UK) according to the manufacturer's instructions. 

Sequence determination. DNA sequencing was carried out as 
described previously [4]. Analysis of sequence data was carried out 
with Wisconsin Molecular Biology software [3] on a VAX com- 
puter. The sequence was determined on two cloned fragments 
derived from different PCR experiments. Where two clones dif- 
fered in sequence, as they did in three positions, a third, indepen- 
dently amplified fragment was cloned and sequenced. 

Southern blot analysis. Amplified DNA fragments ( - 1 u.g) sepa- 
rated on a 0.8% agarose gel were transferred to Hybond N+ nylon 
membrane (Amersham International, Amersham, UK) by a modi- 
fication of the method of Southern [13]. The DNA was fixed by 
placing the membrane on 3MM Whatman paper soaked in 0.4 m 
NaOH for 20 min. Oligonucleotide probe NN3 (5 ' -TTTAGTTTCT- 
TAGATCAATGGTGG-3') was labeled and detected with an En- 
hanced Chemiluminescence (ECL) kit (Amersham International), 
following the manufacturer's instructions. Blots were hybridized at 
42°C for 2 h and washed at 42°C or 48°C in 1 x SSC, 0.1% (wt/vol) 
SDS for 30 min. After the addition of detection agents, the blot was 
exposed to X-ray film for between 30 s and 2 min. 



Results and Discussion 

Examination of clostridial strains for presence of 
NTNH gene. The presence of sequences similar to the 
NTNH gene was detected with primers NN1 and 
NN2. NN1 was based on the sequence of NTNH I C 
gene and N-terminal protein sequence data [14], and 
NN2 was designed with the sequences of NTNH I C 
gene [18] and the region upstream of the gene 
encoding BoNT/F. PCR was performed with primers 
NN1 and NN2 and template DNA prepared from the 
clostridial strains shown in Table 1 (Fig. 1). Only 
strains that produce BoNT gave a DNA fragment 
(approx. 1.9 kb in size) in the reaction (Fig. 2). Gels of 
these fragments were blotted and probed with oligo- 
nucleotide NN3, designed to the sequence FS- 
FLDQWW (amino acids 666 to 673) of NTNH/C 
[18]. This region was considered likely to be con- 
served, and the two tryptophan residues minimized 
the need for degenerate oligonucleotides. All of the 
amplified bands hybridized to probe NN3, but to 
differing extents, presumably reflecting the degree of 
identity between the probe and the target DNA 
sequences (Fig. 2). 

Two strains producing BoNT/A were tested by 
PCR with NN1 and NN2: NCTC 7272 and Kyoto. The 
latter, isolated from a case of infant botulism, pro- 
duces BoNT with a heavy chain of altered mobility on 
SDS gel electrophoresis [12]. A very faint PCR 
product was obtained with primers NN1 and NN2 for 
strain NCTC 7272, which hybridized with NN3 (Fig. 
2), but none was visible with Kyoto DNA as a 
template, either as a PCR product after agarose 



71 

K East and M.D. Collins: Genes Encoding Botulinum Neurotoxin Complex M 



A 

(i) 



Sau3AI Sau3M 



San3Ai 

-J 



Sau3Ai 



(ii) 



PLASM1D 

pAET46* 
pA£T48* 

pAJET36 
pAET38 

pAET32 
P AET33 

pCBFl* 
pCBF2* 



PRIMERS 

' 710-698 
710-698 

NN1-NN2 
NN1-NN2 

NN3-657 
NN3-657 

164-236 
164-236 



Fi ,, Diagr a mm a, iC rep reS e„ t atio„o fOTrg e„e showing ^^^^^^^X^S^ 
for cloning fragments amplified by PCR to determine g^gj^g jj^j 0 n a n d use as the template in the reaction, 

obtained by inverse PCR, digesting the chromosomal DNA with S 0 " 3 * 1 '"^^ 
The sequence of primers used in 7— 

is indicated in Fig. 4. 



electrophoresis or by probing with NN3 (Fig. 2). 
However, with primers designed to a different region 
of the gene (NN3 and BT1, Fig. 1 (i)), both type A 
strains gave a PCR product visible after gel electropho- 
resis (Fig. 3). Sequencing PCR products amplified 
from the region upstream of BoNT/A (Kyoto) has 
revealed an orf showing homology with NTNH IC [A. 
Willems, A.K. East, and M.D. Collins, unpublished 
results]. In addition to showing different properties in 
gel electrophoresis, the two type A toxins also dif- 
fered in the antigenicity of the second component of 
the M-complex, i.e., NTNH [15]. The observed differ- 
ences in immunogenicity of the two NTNH/A pro- 
teins are probably owing to variation in the sequence 
of their genes. Nucleotide changes at one or both of 
the primer (NN1 and/or NN2) binding sites in strain 
Kyoto may account for the PCR results. 

Strains of proteolytic and nonproteolytic C. botu- 
linum producing BoNT/B gave a PCR fragment that 
hybridized strongly with probe NN3 (Fig. 2). By 
contrast, a strain 4672U-1, phenotypically resembling 
nonproteolytic C botulinum type B but producing no 
toxin, gave no PCR product [2]. Within a single BoNT 
immunological group, the efficiency of hybridization 
of probe NN3 seemed to be approximately the same 
(Fig. 2), possibly reflecting high sequence similarity in 
the NTNH genes of strains of a particular toxin type. 
In the case of BoNT genes, sequence determination 
of an ~ 1-kb region encoding part of the heavy chain, 
from several strains of the same immunological toxin 
type, showed < 0.2% sequence variation [2]. 



DNA amplified from strains of both C. botulinum 
and C. butyricum producing BoNT/E hybridized 
poorly with probe NN3 (Fig. 2). The hybridization, 
while similar for all BoNT/E-producing strains, was 
much weaker than with fragments amplified from 
DNA of any other toxin type, as shown by the lower 
temperature stringency wash required (Fig. 2c). A 
very high degree of sequence homology has been 
demonstrated for the BoNT IE genes of C botulinum 
and C. butyricum, and lateral transfer from one to the 
other could have occurred [11]. The high sequence 
homology has now been shown to extend upstream of 
the BoNT gene, including that of NTNH [6], indicat- 
ing that this DNA was probably acquired in the same 
transfer event. The poor hybridization of probe NN3 
with samples from type E strains may be explained by 
examination of the sequence of ent-UO, which reveals 
three base mismatches between that of the gene and 
probe [7]. The nontoxic strains of C. botulinum group 
II, phenotypically identical to type E except in toxin 
production, showed no PCR product or hybridization 
signal with the internal probe (Fig. 2). 

Strains of proteolytic and nonproteolytic C botu- 
linum producing BoNT/F, C. barati producing 
BoNT/F, and C botulinum producing BoNT/G gave 
PCR products that hybridized with probe NN3 (Fig. 
2). No PCR product was obtained with DNA of 
Clostridium hastiforme. 

Location of NTNH gene. To investigate the location of 
the NTNH gene, we used oligonucleotides NN3 and 
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3 4 5 6 7 



9 10 11 12 13 



14 15 16 17 18 19 20 21 22 23 24 





3 4 5 6 7 8 9 10 11 12 13 



14 15 16 17 18 19 20 21 22 23 24 




F, Hobbs FT10; 13, type F, Craig 610; (c) 14, type E, Beluga; 15, type E, Sebald P34; 16, type E, VH; 17, 'type E' nontoxic, VPI 4088- 18 'type 
E nontoxic, VPI 10428; 19, 'type E' nontoxic, VPI 2093-1; 20, 'type E' nontoxic, VPI 3158; 21, type E, C. butyncum ATCC 43755; 22 type F 
C barati ATCC 43756; 23, type G, ATCC 27322; 24, C hastiforme DSM 5675. Marker DNA used was 1 kb ladder, (ii) Blots of the gels shown 
m (l) hybridized with probe NN3 and washed at 48°C for (a) and (b), and at 42°C for (c). 

1 2 3 4 5 6 7 8 9 10 11 12 13 



kb 
I - 3.0 

- 2.0 
|- 1.8 

1.0 



Fig. 3. Agarose gel electrophoresis of PCR prod- 
ucts obtained from a reaction with primers NN3 
and BT1 (BT2 for types B and G) and template 
DNA from lane 1, type A, NCTC 7272; 2, type A, 
Kyoto-F; 3, type B, Eklund 2B; 4, 'type B' non- 
toxic, 4672 U-l; 5, type E, Beluga; 6, 'type E' non- 
toxic, VPI 3158; 7, type E, C butyncum ATCC 
43755; 8, type F, Eklund 202F; 9, type F, Lange- 
land; 10, type F, C barati ATCC 43756; 11, type G, 
ATCC 27322; 12, C. hastiforme DSM 5675. 



BT1 or BT2 (BT1 and BT2 are designed to the 
conserved region IWIIPER near the N-terminus of 
BoNT, amino acids 43 to 49 of BoNT/F, Fig. 4) as 
PCR primers with template DNA from representa- 
tive strains of each of the immunological toxin types. 



All toxin-producing strains gave PCR products of a 
similar size (-1.8 kb) (Fig. 3), corresponding to the 
fragment spanning the BoNT and NTNH genes (shown 
in Fig. l(i)). These results show that for all strains 
producing BoNT, the gene for NTNH component of 
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Sau3M 

GATCTTATACCTGCAGCTATAATAAATTCAGTAGTATATTGGTCTATGGAAAGTATCAAA 60 
DLIPAAIINSVVYWSMESIK 

GAAGCATTAGG ATTCC AATTGGC AAAT AATT.TTAC AG ATATAATAAAT AAT ATTGTTGAA 120 
EAL'GFQLANNFTDI INNIVE 

TGGAATAATATAAAAATTTCAGAAGTTACAAATGTAATATTAAATGTAGGATTTTGTATA 180 
WNNIKISEVTNVILNVGFCI 

CAAGGAAAT ATGAATTAAAQM^TG AAATATATGAAAATAAATGATGATTTAAATATAA 240 

QGNMN* MKINDDLNI (9) 

NN1 ^ ^ 698 

ATTCTCC1GTAGATAATAAAAATGTTGTAATAGTTAGAGCAAGAAAGACGAATATTTTTT 300 

NSPVDNKNVVIVRARKTNIF (29) 

710 ^ 

TTAAGGCGTTTCAAGTTGCTCCTAATATTTGGGTAGCTCCAGAAAGGTATTATGGAGAAC 360 

FKAFQVAPN IWVAPERYYGE (49) 
Sau3M 

CATTGAAT ATAAGTGATCAAGAAAAATCTGATGGTGGAATATATGATGAGAATTTTCTTA 420 

PLNISDQEKSDGGIYDENFL (69) 

AAGAAAATTCTGAAAAAGAAGAATTTTTGCAAGCAATTATTTTACTACTTAAAAGAATAA 480 

KENSEKEEFLQAI ILLLKRI (89) 

ATAATAATATCATAGGCCAAAAGCTACTATCATTGATGTGCACATCTATTCCATTTCTTC 540 

NNNIIGQKLLSLMCTSIPFL (109) 

ATGAATACAAACAAGGAGATTATAGGCAAAGTAATTATCTTGGCTCTAAAAATAGCGAAT 600 

HEYKQGDYRQ5NYLG SKNSE (129) 

ATCTCTATTCTGCAAATAT AGTTATTTTTGGTCCAGGATCAAATATAGT AAAAAATAAT A 660 

YLYSANIVIFGPGSNIVKNN (149) 

CTATTTATTATAAAAAAAATTTTGCGGAAAACGGTATGGGAACTATGGCAGAAATATTAT 720 

TIYYKKNFAENGMGTMAEIL (169) 

TTCAACCCCTTTTAACTT AT AAAT AT AATCAATTTT ATGCTGATCCTGCACTAG AATTAA 780 

FQP LLTYKYNQFYADPALEL' (189) 

TAAAATGCTTAAT AAAAGCCATATATTTCTTGTATGGAATAAAACCTAATGAT AATCTAA 84 0 

IKCL IKAIYFLYGIKPN DNL (209) 

ATATTCCTTATAGACTAAGAAATGAATTTAGTAATGTTGAATATTCAGAATTAAATATTA 900 

NIPYRLRNEFSNVEYSELNI (229) 



TTGATTTTTTAATATC AGGAGGAATTGATTATAAATTTATAAATACTAATCCTTATTGGT 960 

IDFLISGGIDYKFINTNPYW (249) 

TTATAGATAATTATTTTATTGACGTTCCAAAAGTTTTTGAAAAACATAAAAATGATTATG 1020 

FIDNYFIDVPKVFEKHKNDY (269) 



AAAT AAACATTAAAAAT AAT AGTGAAATTGGGACTAGTATTAAATTATATTTAGAGCAAA 1080 

EINIKNNSEIGTSIKLYLEQ (289) Fig. 4. Complete nucleotide sequence 

of the gene encoding the NTNH com- 

AATTTAAAACTAATGTACAAGATATATGGGAACTAAATTTAAGTTATTTTTCTAAAGAAT 114 0 ponent of the botulinum toxin complex 

KFKTN VQD I WE LNL S Y F 'S K E (309) J~S£ty^^^ 

TTCAAATTATGATGCC AG AAAAACAT AATAATGCACTTAAGC ATTATTATAGAAAAGAAT 1200 v/ 1 "^? 6 F ( 202F )* Tne position and^J 
fqimmpekhnnalkhyyrke ( 329) ^eTmence-of'prirners'used'in PCR"is 

indicated, with mismatches between 

ACTATAAAATAAATTACTCTAAACAATATGATATAAACGGCTTTGTT AATGGTCAAATTG 1260 primer and the sequence shown indi- 
YYKINYSKQYDINGFVNGQI (349) 4 . r • ~, - 

v cated by underlining. The amino acid 

CAACAAAGTTGCTTTTATCAGAAAAAAATCAATATATTATAAACAAACCCCAATTAATTA 1320 sequence of the protein is given under 
atklllseknqyiinkpqli (369) the second base of each codon. 



M-complex lies directly upstream of the BoNT gene. 
The nontoxigenic strains gave no PCR product with 
primers NN3 and BT1 (Fig. 3). 

Sequence determination of NTNH IF gene. The se- 
quence of the gene encoding NTNH/F was deter- 



mined after cloning fragments amplified by PCR from 
C. botulinum (strain 202F) chromosomal DNA (out- 
lined in Fig. l(ii)). PCR primers were designed with 
known sequence upstream of BoNT 'IF [4] and based 
on regions of NTNH/C [18] that were considered 
likely to be conserved. Inverse PCR was used to 
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TTAATTTAATAAATAAAAGT AATAATAGTTT ATTAATGAAAAGT AAC ATTTATGGAGATG 1380 

INLINKSNNSLLMKSNIYGD (389) 

GATTAAATGGAACTACAGAT AATTTTTACAGGAATTATAAAATTCCAGATAATATAGCTT 1440 

GLNGTTDN FY R '~N YKIPDNIA (409) 

ATCAATATCATCCTAATAATACTTATTTAGATAATGTAAATATTGAAGAAATAAATAATA 1500 

YQYHPNNTYLDNVNIEEINN (429) 

TTCCTCAAATTACTGATGCAGATAT ATATCC AT AT AC AAATAATTGTG AT ACTTTT ATAC 1560 

IPQITDADIYPYTNNCDTFI (449) 

CTATATATAATATAACTCAATCTAGAGAAATAAATACTACTGTACCTTATTCTATAAATT 1620 

PIYNITQSREINTTVPYSIN (469) 

ATTTGCAGTCTCAAATTATGAATAGTGACGATATTACTCTATCTTCTGATTTTTGGGAAG 1680 

YLQSQIMNSDDITLSSDFWE (489) 

TAGTTTGTTCAAATGATAAATCTTTAGTATATTCTTATTTAGATAATGTAATAAATTATT 1740 

VVCSNDKSLVYSYLDNVINY (509) 

TAGATTCTATAAAAAATAATACTCCAATTAATACAGATAAAAAGTATTACTTATGGCTAA 1800 

LDSIKNNTPINTDKKYYLWL (529) 

AAGAAATATTTAGGAACTATTCTTTTGATATTACAGCAACTGAAGAAATTACTACTGAAT 1860 

KEIFRNYSFDITATEEITTE (549) 

GTGGAATTAATAAAATAGTATCTTGGTTTGGAAAAGCTTTAAATATTTTAAATACAGATA 1920 

CGINKIVSWFGKALNILNTD (569) 

ATTCTTTTAAAATAGAATTTCAAAATTCTGGGGCAATTGCTCTTATTAATAAAAAAGACA 1980 

NSFKIEFQNSGAIAL INKKD (589) 

ATATAATCATTCCAAAAATAGAGATTGATGAAATGCCTAATAGTATGTTAAATTTATCAT 2040 

NIIIPKIEIDEMPNSMLNLS (609) 

TTGAAGATTTAAATGAACAGTT ATAT AGTAT ATATTCT AAAAATATAACTTACTTTAAAA 2100 

FEDLNEQLYSIYSKNITYFK (629) 

NN3 ^ 

AAATCTATTATAATTTCTTAGATCAATGGTGGACAGAAT ATTATAGTCAATATTTTGATT 2160 

KIYYNFLDQWWTEYYSQYFD (649) 

^ NN2 

TAATTTGTATGGCAAAAAAATCAATATTAGCACAAGAAAATTTAATCAAAAAAATAATAC 2220 
LICMAKKS ILA QENLI KKI I (669) 

AAAAGAAAATAAGCT ATTT AATTGGGGCATCTAATATACCTGATGATATATT AGCAGTAA 2280 
QKKISYLIGASNIPDDILAV (689) 

TGAGATTGACAACAACTAAT ACTTT AAGAGAT AT ATCAGTCGAATC ACAAATAGC AATGA 234 0 
MRLTTTNTLRDISVESQIAM (709) 

ATAATTTAAAT AATTTTTTAAAT AAAGCGGCTATGTGTGTGTTTCAAAGTAAT ATTTATC 2400 
NNLNNFLNKAAMCVFQSNIY (729) 

CTAAATTTATCTCTTTCATGGAACAATGTATTAAGCATATAAACAAAAGCACTAAAGAGT 24 60 
PKFISFMEQCIKHINKSTKE (749) 

TTATACAAAAATGTACTAATATTAATGAAACTGAAAAATTACAATTGATTATGCAAAATA 2 520 
FIQKCTNINETEKLQLIMQN (769) 

GTTTTAGT AATTTAGATTTTGATTTTTT AGATATTCAAAATATGAAGAATCTATTTAACT 2580 
SFSNLDFDFLDIQNMKNLFN (789) 

CAT AT ACAGAACTACTTATAAAAGAACAAACCTCACCCTATGAATTATCATTATATGCTT 2 64 0 
SYTELLIKEQTSPYELSLYA (809) Fig. 4. Continued. 



obtain the extreme 5 '-end of the gene and upstream 
sequences. 

The complete nucleotide sequence of the 
NTNHIF gene is given in Fig. 4. The gene encodes a 
protein of 1165 amino acids with a calculated rmm of 
136,525. The NTNH gene ends 14 nucleotides before 
the start of BoNT/F. For other BoNT types the 



distance between these two genes varies, e.g., 43 
nucleotides for type A [1]. Directly upstream of the 
start of NTNHIF gene there are no recognized - 10 
and -35 sequences. Further experiments need to be 
performed before conclusions can be drawn on what 
sequences form the promoter, the start of the mRNA, 
and whether the message is mono- or poly-cistronic. 
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TTGAAGAACAAGATAACAATGTTATAGGAGATGCATCAGGT AAAAATACATTAGTAGAAT 2700 

FEEQDNNVIGDASGKNTLVE (829) 

ACCCTAAAGGAATAGAATTAGTTTATGGAATAAATAATAGTGCATTATATTTAAATGGAT 27 60 

YPKGI ELV Y.-G INNSALYLNG (849) 

CTAATCAAAGTATAATTTTTACCAACGATTATTTTGAGAATGGACTAACTAATAGTTTTT 2820 
SNQSI IFTNDYFENGLTNSF' (869) 

CAATTTATTTTTGGTTGAGAAATTTGGGCCAAGATACTATTAAATCTAAGTTAATAGGTA 2880 
SIYFWLRNLGQDTIKSKLIG (889) 

GTAAAGAATATAATTGTGGTTGGGAAATTTATTTTCAAGAGATTGGGCATGTTTTTAATA 2940 
SKEYNCGWEIYFQEIGHVFN (909) 

TGATAGATTCTAATGGAAATGAGAAAAATATATATCTATCCGATGTTTCTAATAATAGTT 3000 
MIDSNGNEKNIYLSDVSNNS (929) 
SaulM 

GGCACTATATAACTATATCTGTGGATCGTTTAAAAGAACAATTATTAATATTTATTGATG 3060 
WHYITISVDRLKEQLLIFID (949) 

AT AATTTAGT AGTTAATGAAAGTATTAAAGATATTTT AAAT ATTTATTCAAGTAATATAA 3120 
DNLVVNESI. KDILNIYSSNI (969) 

^ 657 

TTTCTTTATTAAGTGATAATAAGGCAAGTTATATTGAGGGATTAACTATTTTAAATAAAC 3180 4 
ISLLSDNKASYIEGLTILNK (989) 

CT ACT AC AGGTG AAG AAGTTTTG AGAAATT ATTTTAAG AACCT AAAT AATTC AT ATGT AA 3240 

PTTGEEVLRNYFKNLNNSYV (1009) 

GAGATAGTAATGATGAACGATTAGAATATAAT AAGACATATCAATTATATGATTATGTAT 3300 

RDSNDERLEYNKTYQLY'DYV (1029) 

TTCCAGATAATCCTATATGTGAAGTTAAACAAGATAATAATATATATTTAACAATTAATA 3360 

FPDNPICEVKQDNNIYLTIN (1049) 

ATATAAAT AATTTAAATATGAAACCTTGTAAATTT AAATTATTAAGT ATT AATTC AAAT A 34 20 

NINN'LNMKPCKFKLLSINSN (1069) 

AACAATATGTTCAGAAATGGGATGAAGTAATAATATCTGTATTATATGAT ACAGAAAAAT 34 80 

KQYVQKWDEVI I SVLYDTEK (1089) 

ATGTATGTATATCT AATG AAAATAAT AGAGTTAAAATAAT AGATAATAAAAT AATGCAAG 3540 

YVCISNENNRVKI IDNKIMQ (1109) 

TAAAATTTATTATTAGTAATGATATATTTATTTCAAATTGTTTAACGCATGCTCATAACA 3600 

VKFI ISNDIFISNCLTHAHN (1129) 

ATAAGTATATATGTTTATCTATGAAAGATGAGAACTATAATTGGATGATATGTAACAATG 3660 

NKYICLSMKDENYNWMICNN (1149) 

AAAGCAACATACCTAAAAAGGCATATTTGTGGATATTGAAAGAAGTATAGGGGGGATTTT 3720 

ESNIPKKAYLWILKEV* (1165) 

TATGCCAGTTGCAATAAATAGTTTTAATTATAATGACCCTGTTAATGATGATACAATTTT 3780 
MPVAINSFNYNDPVNDDTIL (20) 

236 164 

ATACATGCAGATACCATATGAAGAAAAAAGTAAAAAATATTATAAAGCTTTTGAGATTAT 3840 
YMQIPYEEKSKKYYKAFEIM (40) 

~* BT1 

^ BT2 5om3AJ 



G££TAATGTTTGGATAATTCCTG AGAGAAATACAATAGGAACGAATCCTAGTGATTTTGA 3900 

RNVWI IPERNTIGTNPSDFD (60) 

TCCACCGGCTTCA BoNTIF 3913 

P P A S (64) Fig. 4. Continued. 



Upstream of the genes encoding NTNH/F, Ent-120 
(NTNH/E), and NTNH of C. butyricum there are 
sequences which could be the 3 '-end of an open 
reading frame [Fig. 4]. These nonproteolytic strains 
of C. botulinum do not produce hemagglutinin, the 
gene for which is found immediately upstream of 



NTNH/C [18], and the sequence encoded here shows 
no homology to the hemagglutinin of type C [17]. 
Since type E and F DNA sequences are homologous 
in this region upstream of genes encoding the compo- 
nents of botulinum protein complex M, the sequences 
could also be concerned with the BoNT-complex, 
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NTNH/C . -T^™—™^^ 

NTNH/E LI STAIPFPYENNT GOYTOSHTflGSKHSEYLYSANIVIFGffCSHIVKHIlTIlfYKKNFAENGMCTMAE 

NTNH/F LMCTSIFFLHEYKQ 



NTNH/F DYEINIKN»SEIGTSI!a;.YLEQKFKWVQDI«^»«"V-™^n„. 

NTNH/C KVVNLVNE.N*™^^^ 

NTNH/C ALS^^T^ENFTLSSOrsm— « = ™ 

NTNH/C II^SNSm^DSGAISLISKKDHLREPNIEIDDISM^ 

ntnh/c v^vcNKrTDLSKAs.iPPorau.™^^ 

NTNH/C ^SVTTKTi™^!™—^ 

-^—.T^C^YTEDNGLVFE^ 
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NTNH/ba I* . xmmftfr 
, . • j e n( wTMH/r NTNH/E and NTNH/F, and partial amino acid sequences ot N INnot C 

Fig. 5. Multiple alignment of amino acid sequences of NTNH/C, NTNH/b, ana w i rm r p 
bamti [16] and C. botulinum type A (translated from [1]). Ammo acids that are identical in all proteins are shown bold. 

• • t a - f^otinn nf Table 2. Percentage similarities (lower left-hand triangle) and 
perhaps encoding proteins involved m formation Ot identities (upper right-hand triangle) of (i) NTNH and (ii) BoNT 

the complex or control of expression of the constitu- 

amino acid sequences 

ent proteins. 

Comparison of NTNH/F with other NTNHs. A mul- 



(i) NTNH (ii) BoNT 



tiple alignment of NTNH/F, NTNH/C, and NTNH/E Type c E F Type C E F 

r & . ,™rrm.i.„.-.'Tiflt:^«H-h, ~ _ 57 9 60.7 C - 34.5 34.5 

75 0 - 71.2 E 56.0 - 62.6 



77.4 — 



is shown in Fig. 5. The NTNH/F shows 71% identity c 

with NTNH/E and 61% identity with NTNH/C e 75.0 - 7U E 56.0 

(Table 2). For types C, E, and F the NTNH compo- F 761 [ 

nent of the toxin complex is more highly conserved 

than BoNT (Table 2). This is perhaps surprising when mtmh/p fFie yi From PCR results 

the proposed function of the two proteins is cons.d- J^^t^I^^U^pe. 

erediBoNTisahighlyspecificandpotentneurotox.n it ^* a J^. ™J ^ as p P CR ducts g m 

while the suggested function of NTNH ,s to protect E and F ^Jf G d ^^ large P r (Fig . 2) . A 

the toxin from proteolytic degradation [15]. types a, o, showe d that BoNT/E 

NTNH/F and NTNH/E have a common deletion comparison of BoNT sequences showed tnat boin / 
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A RoN T/F are more closely related to each other 
if to other toxin types [8].- The high sequence 
* L of the NTNH components may be sigmfi- 
St-siS the evolution of the botulinum 

^results presented here show that all BoNT- 

i \ the aenes of the two components ot the m 
tZ E^rally linked. Since only some of 

t t% forms of the progenitor complex, it will oe 
^reS to compare the structural maps of genes 

only the M-complex. 
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B. R. DasGupta and S. Rasmussen. Amino acid composition of Clostridium botulinum type ¥ 
neurotoxin. Toxicon 21, 566-569, 1983. — To develop reliable data on the amino acid composition 
of type F botulinum neurotoxin, three batches of the neurotoxin were analyzed. Each batch was 
isolated from a separate neurotoxin producing bacterial culture. Two batches had inoculum from 
one source and the other batch one from a different source. Two batches of the neurotoxin were 
purified by the same method and one was purified by a different method. The neurotoxin preparations 
were found comparable in purity and similar in amino acid composition. The best estimate of number 
of amino acid residues per neurotoxin molecule (mol. wt. 155,000) was: Aspi, 8 Thr M Ser 10 $ Glu, 
Pro 4 r GIy„ A!a 4 T Val„ CyS, Met 14 Ile ll8 Leu IM Tyr B6 Phe 60 Lys M His,, Arg„ Trp«. 



The SOLE cause for the neuroparalytic food-borne disease botulism is botulinum 
neurotoxin, a protein produced by Clostridium botulinum. The neurotoxin is found in 
several antigenically distinguishable forms; types A, B, d, D, E, F (Sugiyama, 1980) and/ 
the recently discovered type G (Lewis et ai> 1981). The type F botulinum neurotoxin is 
synthesized, like types A, B, C, D and E, as a single chain protein (mol. wt. ~ 15j0,000) 
and depending on the physiology of the anaerobic bacteria is cleaved (nicked) endogenously 
in the bacterial culture into a dichain (nicked) molecule. The two chains (mol. wt. 100,000 
and ~50,000) of the nicked molecule are held together by at least one disulfide bond. 
Nothing more is known about the structure of type F neurotoxin, although studies on the 
structure and structure -function relationship of type A, B, C, and E have progressed 
(DasGupta, 1981; DasGupta and Sugiyama, 19770; Syuto and Kubo, 1981). Here we 
report on the amino acid composition of type F neurotoxin isolated by two different methods 
(Yang and Sugiyama, 1975; Ohishi and Sakaguchi, 1975). | 

Type F neurotoxin was purified in three separate batches. Stock cultures of Clostridium 
botulinum type F (strain, Langeland) were: (i) from the Food Research Institute (used in 
a previous study, DasGupta and Sugiyama, 19776), maintained in the toxin production 
medium at 8 n C and used to produce batches #1 and #3 of neurotoxin; (ii) kindly supplied 
by Dr. Lynn S. Siegel, U.S. Army Medical Research Institute of Infectious Diseases, 
Frederick, MD, in cooked meat medium and stored at 8*C, which was used for batdh #2. 
Each batch of neurotoxin was produced in 15 liters of medium of identical composition 
(Yang and Sugiyama, 1975) and incubated for 5 days at 30°C following inoculatioli with 
1.0 ml of a stock culture. Batches #1 and #2 were purified by one method (Yan^} and 
Sugiyama, 1975) and batch #3 by another method (Ohishi and Sakaguchi, 1975). Purity 
of the neurotoxin preparations was assayed (see discussion in DasGupta and Sugiyama, 
1977a) by electrophoresis in 5% polyacrylamide gels in the presence of 0.1 sodium dodecyl 
sulfate (PAGE-SDS) (Weber and Osborn, 1969). 
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For amino acid analysis, procedures described by MOORE and Stein (1963) and MOORE 
(1972) were used. Cysteine and cystine were determined as 1/2 cystine (i.e. cysteic acid) 
following performic acid oxidation and 24 hr hydrolysis in 6 N HO. Tryptophan was 
determined from the absorbance of the protein dissolved in 6 M guanidine-HCl 
(Edelhoch, 1967). The Durrum D-500 automatic amino acid analyzer was calibrated with 
standard amino acids (10 nmoles each) immediately before and after analyzing the/ 
hydrolyzed protein samples. The exact detaiis of the above procedures followed and 
calculation of the number of amino acid residues have been described (DasGupta and 
Rasmussen, 1983). Molecular weight of the neurotoxin, based on the PAGE-SDS 
technique, was reported to be 155,000 (Yang and Suoiyama, 1975) and 157,000 ^OHISHI 
and Sakacuchi, 1975). We used the lower value to calculate the number of amino acid 
, residues. 

Amino acids recovered from one of the three batches of the neurotoxin (batch 3) following 
24, 48 and 72 hr HC1 hydrolysis, performic acid oxidation - HC1 hydrolysis, krid the 
calculated ratio of tryptophan to tyrosine are tabulated in Table 1. No unusual peak was 
found on any of the chromatograms. The calculated number of amino acid residues per 
neurotoxin molecule (mol. wt. 155,000) from batches 1, 2 and 3 are given in Table 2. 
Agreement in amino acid content among the three batches was as follows (the ejrror on 
the Durrum D^SOO analyzer is ± 3%): nine amino acids (Asp, Pro, 1/2 Cys, Met, lie, Leu, 
Phe, Lys, Arg) were < 3%; six amino acids (Thr, Ser, Glu, Ala, Tyr, Trp) weri > 3% 
but < 5%; three amino acids (Gly, His, Val) were > 5%. In those cases where one batch 



Table I. Amino acid analysis of Botulinum neurotoxin type F (batch #3) 



HCI hydrolysis 



Mean 



Minimum 
number of 
residues 



Amino acid 


Performic acid 


24 hr 


48 hr 


72 hr 


(Lys = 100) (His = 1) 


Aspartic acid 


70.24 


85.92 


42.44 


29.79 


29.37 


240.36 1 


7.66 


Threonine 


23.21 


37.09 


15.15 


10.42 


9.90 


90.05* 


6.62 


Serine 


26.85 


34.18 


19.69 


13.56 


12.02 


116.7* 


8.57 


Glutamic acid 


38.18 


46.37 


22.76 


16.12 


15.95 


129.83 


9.54 


Proline 


14.64 


17.96 


8.78 


6.09 


6.27 


50.06 


3.68 


Glycine 


20.67 


24.24 


11.60 


8.25 


8.10 


66.18 


4.86 


Alanine 


15.46 


18.76 


9.16 


6.44 


6.29 


51.77 


3.80 


Valine 


17.53 


21.72 


11.37 


8.60 


9.00f 


73.11+ 


5.37 


Cystine 


0.40 


0.26 


0.42 


0.22 


0.13 






Methionine 


0.04 




2.70 


1.89 


1.80 


15.09 


1.11 


Isoleucine 


35.89 


43.46 


22.19 


15.46 


17.06f 


138.59+ 1 


0.18 


Leucine 


34.09 


42.02 


19.52 


12.89 


13.79 


111.944: 


8.22 


Tyrosine 


0.04 


0.03 


16.96 


12.05 


12.14 


97.53 


7.17 


Phenylalanine 


18.53 


22.27 


11.55 


8.12 


8.14 


65.85 


4.84 


Ammonia | 
















Lysine 


28.64 


35.23 


17.45 


12.45 


12.31 , 


100 


7.35 


Histidine 1 


0.69 


0.58 


2.32 


1.71 


1.70 


13.61 


1.00 


Arginine , 


16.08 


19.98 


9.97 


6.96 


6.90 , 


56.36 


4.14 


X A Cystine j 


2.77 


3.72 








, 10.12§ 


0.74 


Tryptophan j 












\ 26.241 


1.93 



Columns under performic acid, 24, 48 and 72 hr: nmoles of amino acids recovered from 40 pi hydrolysate. 
• Thr and Ser extrapolated to zero time (Ser 72 hr value not included). | 
+ Val 72 hr value (at 24 and 48 hr, 65.16 and 69.08, respectively) slow release. He 72 hr value (at 24 and 48 
hr, 127.16 and 124.18, respectively) slow release. 
+ Leu mean of 24 and 72 hr values, 111.86 and 112.02, respectively (48 hr value, 103.53, not included). 
§ Vi Cys mean of two values, 9.67 and 10.56, from duplicate runs. 1 
\ Based on tryptophan: tyrosine = 0.269 (from batch 1). 
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Table 2.! Number of amino acid residues per type F Botulinum neurotoxin (mol. wt. 



155,000) 



Amino acid 



Batch 1 



Batch 2 



Batch 3 



Mean f three Best of three 



Aspartic acid 
Threonine 
Serine 
Glutamic acid 
Proline 
Glycine 
Alanine 
Valine 
Vi Cystine 
Methionine 
Isolcucine 
Leucine 
Tyrosine 
Phenylalanine 
Lysine 

Histidine { 
Arginine 1 
Tryptophan | 
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124 


128 


127 


128 / 


104 


104 
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60 
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90 
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92 


91 


90 


10 


13 


13 


12 


13 


51 


51 


52 


51 


51 


23 


23 


24 


23 


23 



Each batch of neurotoxin was acid hydrolyzed for 24, 48 and 72 hr. 

♦1/2 Cys in batches 1 and 2 were based on the value from batch 3 (run in duplicate). 

Maximum deviation from the mean: 

< 3% ; Asp, Pro, Vi Cys, Met, lie, Leu, Phe, Lys, Arg; 

> 3% but < 5% Thr. Ser, Glu, Ala, Tyr, Trp; 

> 5<V Gly. His, Val. 



yielded significantly high or low values (i.e. beyond the instrument error), very good 
agreement was found between the other two batches (the last column in Table 2). There 
is no consistent difference among the three batches that can be ascribed to the source of 
the stock bacterial culture or method of purification. J 

The type F botulinum neurotoxin independently purified in two laboratories pfANG and 
Sugiyama, 1975; Ohishi and Sakaguchi, 1975) have not previously been compared. We 
found that the neurotoxin purified by these two methods is (i) comparable ih purity (as 
judged by PAGE-SDS) and (ii) similar in amino acid composition. This observation, 
seemingly trivial, is indeed significant, because the literature on botulinum neurotoxin is 
replete with claims of purification and characterization of types A, B and E neurotoxin, 
many of which are inconsistent and irreproducible (see discussions in DasGupta and 
Sugiyama, 1977a; DasGupta and Rasmussen, 1983). 
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Organization arid Phylogenetic Interrelationships of Genes Encoding 
Components of the Botulinum Toxin Complex in Proteolytic 
Clostridium botulinum Types A, B, and F: Evidence of 
Chimeric Sequences in the Gene Encoding the t 
Nontoxic Nonhemagglutinin Component 

ALISON K. EAST,* MANJU BHANDARI, JOANNA M. STACEY, 
KATHRYN D. CAMPBELL, and MATTHEW D. COLLINS 

Department of Microbiology, Institute of Food Research, Reading, 
Berkshire RG6 6BZ, United Kingdom 

The cluster of genes encoding components of the botulinum neurotoxin (BoNT) complex was mapped in 
proteolytic (group I) Clostridium botulinum strains encoding BoNT types A, B, and F. Two different arrange- 
ments of genes were found: type A strain 62A and type B strain NCTC 7273 have similar organizations of genes 
encoding BoNT, the nontoxic nonhemagglutinin component (NTNH), hemagglutinin components, and P-21; 
type F strain Langeland has genes encoding BoNT, NTNH, and P-21, and a previously unidentified open 
reading frame encoding a protein of 416 amino acids. A group of type A strains typified by infant strain 
Kyoto-F, which is unlike type A strain 62A, lacks genes for hemagglutinin components and exhibits an 
organization similar to that of type F. Sequencing and pairwise analysis revealed the presence of possible 
chimeric sequences in some NTNH genes of proteolytic C. botulinum. Discordance in genealogical trees derived 
from different regions of the NTNH genes was observed which could be symptomatic of recombination and 
which may indicate that the NTNH gene represents a hot spot for such events within the cluster of genes 
encoding the BoNT complex. It is also evident that the phylogenetics of the NTNH gene, which is linked to the 
gene encoding BoNT, does not mirror the evolutionary history of the BoNT, upon which the C. botulinum 
species complex is defined and subdivided. 



Strains of the species Clostridium botulinum produce the 
extremely toxic botulinum neurotoxin (BoNT). BoNT is the 
causative agent of the severe neuroparalytic illness in humans 
and animals referred to as botulism. Bo NTs are classified into 
seven different types (A through to G) by reaction with specific 
antisera. C botulinum strains are traditionally divided into four 
physiological groups, which are designated I to IV (18). Pro- 
teolytic group I C botulinum strains produce a single toxin or, 
occasionally, dual toxins of type A, B, or F. Nonproteolytic 
group II C. botulinum strains invariably produce a single toxin 
of type B, E, or R Group III strains produce BoNT type C or 
BoNT/D and are responsible for animal botulism, while group 
IV strains produce BoNT/G. Strains of groups I and II, espe- 
cially those of types A and B, are important agents of both 
food-borne and infant botulism (18). In addition, some strains 
of the non-C. botulinum species are known to synthesize BoNT 
(viz., Clostridium barati [type F] and Clostridium butyricum 
[type E]). Recently, the evolutionary interrelationships of G 
botulinum types A to G, C barati type F f and C butyricum type 
E and their BoNT genes have been systematically investigated. 
16S rRNA sequencing revealed the presence of four distinct 
phylogenetic lines within C botulinum which correlate with the 
four physiological groups (I to IV) (22). The same studies 
revealed that neurotoxigenic C barati and G butyricum strains 
are authentic members of their respective species and are 
genealogically distinct from each other and the four G botuli- 
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num lines (22). However, major differences in the pattern of 
relationships inferred from 16S rRNA and BoNT gene se- 
quences were observed that were indicative of BoNT gene 
transfer between C botulinum rRNA lines and the non-C. 
botulinum species (3, 10, 20). 

BoNT is produced by G botulinum as a noncovalently bound 
complex of two or more protein components. The smallest 
complex, the M complex (ca. 300 kDa), which is found in all 
strains (except type G) which produce neurotoxin, consists of 
BoNT with a component that has a similar size (ca. 150 kDa), 
the nontoxic nonhemagglutinin component (NTNH) (33). The 
size of the complex formed by G botulinum shows some cor- 
relation with the toxin type encoded; strains encoding types E 
and F (26) and some type A strains (30) produce only the M 
complex, while strains encoding types A, B, C, and D, in ad- 
dition, produce L complexes (ca. 500 kDa), which have asso- 
ciated hemagglutinin (HA) activity (28). Type G strains pro- 
duce only the L complex (25). Many type A strains produce not 
only the M and L complexes, but also a larger LL complex (ca. 
900 kDa). The purported function of the nontoxic components 
of the complex is to increase the stability of BoNT in the 
gastrointestinal tract (29). The gene encoding NTNH is lo- 
cated immediately upstream of that of BoNT in all toxin types 
(6), and genes for other components of the complexes have 
been shown clustered upstream of the NTNH gene in strains 
encoding types A and B (8) and type C (15, 19, 32, 33). 

Recently, we showed that two distinct populations of G 
botulinum type A'strains exist, encoding distinct BoNT/A genes 
(designated BoNT/Al and BoNT/A2) which can be distin- 
guished from each other by DNA probes and by restriction 
fragment length polymorphism analysis (4). Type Al and A2 
strains encode BoNTs which show ca. 89% sequence identity 
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(1 31 35). These strains also differ in the composition of the 
progenitor toxin complex produced and the organization of 
genes in the toxin cluster (4). It has been shown that approx- 
imately 50% of C. botulinum type A strains contain unex- 
pressed BoNT/B gene sequences (12). In the C. botulinum 
strains thus far examined, those which carry additional 
BoNT/B genes are invariably of type Al (4). To date, knowl- 
edge of the. gene organization and evolutionary interrelation- 
ships of the various progenitor toxin components has not been 
systematically examined. Since genes encoding these complex 
components are physically adjacent to genes responsible for 
BoNT production, it is not unreasonable to expect that their 
genealogical relationships would mirror that of the neurotoxin. 

In this paper, we report elucidation of the organization of 
the cluster of genes encoding components of the BoNT com- 
plex in group I C botulinum type A, B, and F strains. We 
report the finding of two distinct gene organizations: one for 
types Al and B and another for types A2 and F. The complete 
sequence of the NTNH gene is presented for strains encoding 
BoNT types Al, A2, and B, and genealogical analyses provide 
evidence for chimera-like sequences in some genes. Discor- 
dance in trees derived from different regions of the NTNH 
genes, which is symptomatic of recombination, was observed; 
indicating that the NTNH gene may be a hot spot for such 
events within the cluster of genes coding for the BoNT com- 
plex. 

MATERIALS AND METHODS 
Bacterial sirains and plasmids. Genomic DNA was prepared from the clos- 
tridial strains listed in Table 1 according to the method described by Lawson et 
al. (23). Escherichia coli One Shot obtained from Invitrogen (R&D Systems, 
Cowley, United Kingdom) was used as the recipient for cloning. 

PCR and cloning. PCR was carried out with Perkin-Elmer Ampli-7fl<? DNA 
polymerase (Applied Biosystems, Warrington, United Kingdom) as described 
previously (6) according to the scheme outlined in Fig. l. The sequences of 
primers used are given in Table 2. PCR products were cloned with a TA cloning 
kit (Invitrogen) as described previously (6). 

Hybridization analysis. Hybridizations were carried out with an fcLL kit 
(Amersham International, Amersham, United Kingdom) according to the man- 
ufacturer's instructions. Slot blots were performed essentially as described by 
Campbell et al. (2), with hybridization carried out at 42°C and washing carried 
out at SOX The oligonucleotide hybridization probes used are listed tn Table 2. 

DNA sequencing. Double-stranded DNA was sequenced as described previ- 
ously (2) with a U:S. Biochemicals Sequenase kit (Amersham) according to the 
manufacturer's instructions. Clones obtained from two separate PCRs were 
sequenced, and, in the case of discrepancy, a third independent clone was 
analyzed. . 

Analysis of data. Analysis was performed with the molecular biology software 
of University of Wisconsin Genetics Compute^ Group package (5). The neigh- 
bor-joining method described by Saitou and Nei (27) was used for construction 
of the phylogenetic trees. The stability of trees was assessed by bootstrap analysis 
(11). Bootstrap values were calculated from 250 replicates. 

Nucleotide sequence accession numbers. The sequences presented here have 
been given EMBUGenBank accession numbers as follows: NTNH gene se- 
quences from strains 62A, Kyoto-F, and NCTC 7273, respectively, X92973 
X87974, and X78230; P-21-NTNH gene sequences from strains Kyoto-F and 
Langeland, respectively, X96493 and X96494; P-21 partial gene sequences from 
strains Chiba-H and NCTC 9837, respectively, X96491 and X96492. 

RESULTS 

Analysis of the NTNH genes. The regions upstream of the 
BoNT genes in type A strains 62A and Kyoto-F, type B strain 
NCTC 7273, and type F strain Langeland were amplified by 
PCR, and fragments were cloned as outlined in Fig. 1. The 
gene' encoding NTNH was found immediately upstream of 
BoNT, and its sequences in types A and B were determined. In 
type A strain 62A, the C-terminal 103 amino acids correspond 
to translation of the region upstream of BoNT/A reported by 
Binz et al. (1), and for type B strain NCTC 7273, the C- 
terminal 8 amino acids correspond to translation of the region 
upstream of BoNT/B (34). For type A strain Kyoto-F, the 



TABLE 1. Details of strains used and results of PCR 
and probing experiments 



Strain" 



PCR with Probe PCR with PCR with PCR with 
H1-H2 HA33/A P1-P2 I+-NN2 I-NN2 



C botulinum group I 
NCTC 887 (Al) 
599 (Al) 
603 (Al) 
2775 (Al) 
62A (Al) 
547 (Al) 
726 (Al) 
1690 (Al) 
NCTC 7272 (Al) 
NCTC 2916 [A1(B)] 
NCTC 11199 
[A1(B)] 
2119 [A1(B)] 
5311 [A1(B)] 
7826 [A1(B)] 
MDalO [A1(B)] 
588 [A1(B)] 
13280 [A1(B)] 
667 [A1(B)1 
657 (Al/B) 
3281(Bf) 
Kyoto-F (A2) 
Chiba-H (A2) 
NCTC 9837 (A2) 
Y-8036 (A2) 
7105-H (A2) 
7103-H (A2) 
KZ1828 (A2) 
NCTC 2012 (A2) 
Langeland (F) 

C. botulinum group II 
Eklund 2B (B) 
Scott 2129B (B) 
Colworth 151 (B) 
Hobbs FT50 (B) 
Hazen 36208 (E) 
Eklund 202F (F) 
VPI 2093-1 (NT) 

C. botulinum group IV 
ATCC 27322 (G) 

C. barati 
ATCC 43756 (F) 

C. butyricum 
ATCC 43755 (E) 



+ 


+ 


+ 


+ 


- 


+ 


+ 


+ 


+ 


- 


+ 


+ 


+ 


+ 


- 


+ 


+ 


+ * 


+ 


- 


ND* 






+ 


- 


+ 


+ 


+ 


+ 


- 


+ 


+ 


+ 


+ 


- 


+ 


+ 


+ 


ND 


ND 


ND 




+ 


ND 


ND 




+ 


+ 


+ 


+ 




+ 


+ 


+ 


+ 






+ 


+ 


+ 






+ 


+ 


+ 






+ 


+ 


+ 


+ 


+ 




+ 


+ 




+ 


+ ' 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


ND 


ND 


ND 


+ 


+ 


ND 


+ 


+ 


ND 




+ 


_ 


+ 










+ 






+ 




+ 










+ 










+ 










+ 










+ 










+ 






+ 




+ 


+ 




+ 


+ 




+ 




+ 


+ 




+ 




+ 


+ 




+ 




+ 


+ 












+ 



"Toxin types are given in parentheses after strain designations. A1(B), pro- 
duction of BoNT/A but with silent BoNT/B present on the genome; Al/B, 
BoNT/A and BoNT/B (dual toxin) production; Bf, BoNT/B and BoNT/F pro- 
duction, but with BoNT/F produced only in minor amounts; NT, nontoxic. 

b ND, not determined. 



C-terminal 16 amino acids of NTNH correspond to the se- 
quence upstream of BoNT/A reported by Willems et al. (35). 
The proteins encoded by strain 62A (type Al), Kyoto-F (type 
A2), and NCTC 7273 (type B) sequences consisted of 1,193, 
1,159 and 1,197 amino acids, respectively, and showed homol- 
ogy to published NTNH sequences of toxins of other types (6, 
13, 14, 33). While this work was in progress, Fujita et al. (16) 
reported the NTNH sequence of type A strain A-NIH, which 
is identical to that reported here for type A strain 62A. The 
overall homology shown between these NTNHs (>60% iden- 
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0) Type A 

A1: strain 62A 



Bont-*. PCRprlment Plasmids 



A2: strain Kyoto-F 
300 is 



HAIJ-1/NTNH-1 pAET92S3 
NN1/NK2 pAET84/85 
NN3/BT3 pAETW/96 



P2/NN5 pMBT32/33 
NN4/NN2 pAET 69/89 

NN3/BT3 pAETB1/B2 



(ii)TypeB 

strain NCTC 7273 



[ ' - 



(Hi) Type F 

strain Langeland 



NN7/NN2 
NN3/BT4 



P2/NN5 
P2/NN6 
NN1/NN2 



pKCTIC 
pKCT3/4 



pM8T32/33 

pMBT35 

PAET79/80 



FIG. 1. Strategy for PCR amplification and cloning of fragments encoding 
components of the BoNT complex from proteolytic C botulinum type A, B, and 
F strains. The strains used and the approximate positions of binding of PCR 
primers are shown. Primer sequences are given in Table 2. The positions of genes 
encoding the components of the complex and direction of transcription are 
shown. The sizes of the intergenic regions are indicated. Dotted lines, sequences 
previously reported (type A strains 62A [1] and Kyoto-F [35], type B strains 
NCTC 7273 [8] and Danish [34], and type F strain Langeland [10]). 



tity) is higher than that shown between the BoNTs, which can 
be as low as ca. 34% identity (3, 10). From an alignment of the 
NTNH amino acid sequences (data not shown), it is clear that 
although the proteins show a high degree of homology, several 
insertions and/or deletions of one or more amino acids are 



evident. The largest and most striking of these is at position 
115, where there are an extra 33 amino acids in types Al and 
B compared with A2. The extra 33 residues are present in the 
NTNH sequence of C botulinum type C (33) but are absent 
from those of C. botulinum type E (14), C butyricum (13), and 
nonproteolytic C. botulinum type F (6). Preliminary sequence 
analysis of the region in proteolytic C botulinum type F strain 
Langeland shows that it too lacks these 33 amino acids (9). 
Since all of the strains examined which possess this insert 
(types Al } B, and C) have HA components in their progenitor 
toxin complex, it is interesting to speculate that this part of the 
NTNH may somehow be involved in interaction with HAs. It 
has recently been shown that the NTNH component of type A, 
C, and D strains of the M complex has been proteolytically 
cleaved at a position within the extra 33 amino acids, while the 
NTNH component of the L complex is unprocessed (16). Since 
the L complex contains components which have HA activity, 
while the M complex lacks them, processing within this region 
may prevent (directly or indirectly) the HA components from 
interacting to form the larger complex. 

To examine the occurrence of this insert in the NTNH genes 
of other strains, PCR primer I+, which binds to a region within 
the extra 33 residues (positions 122 to 127 [Table 2]), and 
primer I-, which spans the site at which the additional resi- 
dues are present in some strains (residues 110 to 117 for 
NTNH/A2 [Table 2]), were used in combination with con- 
served primer NN2 (positions 692 to 699 [Table 2]). PCR 
products were obtained with primer combination I+-NN2 with 
all strains encoding type Al [including A1(B)] and B tested but 
not with any of those encoding type A2 (Table 1). With primer 
combination I--NN2, positive results were obtained for all 
strains encoding types A2, E, F, and, unexpectedly, all type A 
strains containing silent BoNT/B sequences [i.e., Al(B)]. Since 
products were obtained with both pairs of primers for type 
A1(B) strains, this indicates that NTNH genes encoding pro- 
teins with and without the additional 33 amino acids are 



TABLE 2. Oligonucleotide primers used in PCR and probing experiments 



PCR primer 
or probe 



PCR primers 
BT3 
BT4 
HI 
H2 

HAII-1 

1+ 

I- 

NN1 

NN2 

NN3 

NN4 

NN5 

NN6 

NN7 

NTNH-1 

PI 

P2 

Probes 
H33/A 
H33/Bp 
A33/Bnp 



Sequence 5'-3' 



Target amino acids" 



TGCTTCrGGTGGTGGATTTAAATCTCC 

CTCCCCGTACCTCTCGCAAATGG 

AAATTGTTACC ATCTCCTGTAAG G C 

AATTAAATACTTGAATAGCAGTTCCGT 

AT(A/r)TATAATTACCATTAGG 

ATATGATTACITTTGGATCAGCAC 

CTTATGAATATAAACCTGGAGATTAT 

ATA(A/G)ATTC(A/T)CCAGTGGA(T/C)AA 

ATTTTCTTGTGCTAATATTGA 

TTAGTTTCTTAGATCAATGGTGG 

AGAAAGGTATTATGGAGACA 

TTAGAATCATAAATTCC(T/A)CC(G/A)TC 

TGATTATCTTTAGATACT 

TAAGGGGGCGTGTATAGTA 

CTTG CTCTAACT ATT AC AAC 

GATAT(A/r)(C/G/T)TG(AyT)(A/G)(T/C)CAT(C/T)T(A/T)TGG 
T(T/C)TTATATA(C/T)AG(A/C)(T/C)TGACGACT 



CTTCTAATGATCAAAATA 
TCCCGGTAACGGTAACGT 
GCTATGACAGATCTAAG 



Target positions* 



GDLNPPPEA 


57-65 (BoNT/A) 


PFARGTG 


25-31 (BoNT/B) 


1VTISCKA 


16-23 (HA-33) 


GTAJQVFN 


275-282 (HA-33) 


PNGNYNI 


9-15 (HA-17) 


MITFGSA 


122-127 (NTNH/A1) 


YEYKPGDY 


110-117 (NTNH/A2) 


I(N/D)SPVDN 


9-15 (NTNH) 


SILAQEND 


692-699 (NTNH) 


SFLDQWW 


669-675 (NTNH) 


ERYYGE 


44-49 (NTNH) 


DGG1YDS 


60-66 (NTNH) 


VSKDN 


157-161 (NTNH) 


Noncoding 


Upstream of NTNH 


WIVRAR 


18-24 (NTNH) 


DI(UV)xH(F/L)W 


44-50 (P-21) 


SRQ(S/A)(V/I)YK 


167-173 (P-21) 


SNDQN 


231-235 (HA-33) 


PGNGNV 


33-38 (HA-33) 


AMTDLS 


174-179 (HA-33) 



° Amino acids for oligonucleotide target positions, x, any amino acid. 

b Binding positions are given with reference to the following: BoNT/A (1), BoNT/B (34), HA-33 (8), HA-17 (19), NTNH (6), and P-21 (Fig. 6). 
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FIG. 2. Pairwise comparison of NTNH sequences from proteolytic C. botulinum type A strains 62A (Al) and Kyoto-F (A2), type B strain NCTC 7273 (Bp), and 
type A(B) strain 667. 667' A', A cluster; 667'B', B cluster (21). Comparisons were performed on blocks of 100 amino acids (shown at the top). Shading indicates the 
degrees of homology. 




present. Hutson et al. (21) have sequenced the two BoNT gene 
clusters encoded by C. botulinum type A1(B) strain 667, which 
revealed that the NTNH gene of the A cluster (i.e., upstream 
of the expressed BoNT/A) lacks the extra 33 amino acids while 
the NTNH encoded by the B cluster possesses them. 

Pairwise analyses were performed on the NTNH gene se- 
quences of proteolytic C, botulinum. It was evident from these 
analyses that for some pairs of sequences there was consider- 
able variation in the degree of homology along their lengths. 
The variability in the pattern of relationship between different 
positions in the NTNH sequences of representative combina- 
tions of C. botulinum strains is illustrated in Fig. 2. For exam- 
ple, the NTNHs of proteolytic C. botulinum types A and B 
show very high sequence relatedness for approximately the first 
600 residues (>99% identity). In contrast, the amino acids 
from position ca. 600 to the C terminus have a much-reduced 
relatedness, ca. 66% identity. The reverse is observed for the 
NTNHs of C. botulinum types Al and A2, with the C-terminal 
end showing high sequence identity (ca. 93% for position 670 
to the end) compared with that for the N-terminal region (ca. 
67% for positions 1 to 669). Matrices of pairwise sequence 
homology data from different regions of the NTNH (residues 
1 to 550, 551 to 1020, and 1021 to tha C terminus) were used 
to construct comparative genealogical trees (Fig. 3). Major 
differences in the three trees reinforce the incongruence in the 
pattern of relationships along the length of the NTNHs. Such 
marked discordance could be symptomatic of recombination 
and suggests that the NTNH gene is a hot spot for such events 
within the BoNT complex cluster of genes. Hutson et al. (21) 
reported that the NTNH gene adjacent to the silent BoNT/B 
gene (B cluster) in C. botulinum type A(B) strain 667 was 
chimeric, with the 5' and 3' regions of the gene exhibiting high 
homology with the corresponding regions of the type B NTNH 
gene (Fig. 3), while a 417-amino-acid sequence in the central 
region was identical to that in the NTNH whose gene was 
located adjacent to the expressed BoNT/A gene (A cluster) 
(Fig. 2). The results of the present study demonstrate that such 
chimera-like NTNH genes may be commonplace among pro- 
teolytic C. botulinum. 

A comparison of the homology values along the length of the 
cluster of genes encoding the BoNT complex for type A strains 
62A and Kyoto-F and type B strain NCTC 7273 is shown in 
Fig. 4. In addition to the variation in the degree of homology 
in the coding regions, further weight is given to the idea of 



recombination by observation of the intergenic regions. In 
these strains, the intergenic regions are of identical size and 
(almost) identical sequence between HA-17 and HA-33, 
HA-33 and P-21, and P-21 and NTNH in type A strain 62A and 
type B strain NCTC 7273, while the corresponding region in 
type A strain Kyoto-F is noncomparable (Fig. 4). In contrast, 
the intergenic region between the NTNH and BoNT genes is 
identical in size and sequence in the two type A strains, while 
that of type B strain NCTC 7273 is different (Fig. 4). 

Investigation of the HA-33 genes of types A and B. We 
recently showed that type A strain NCTC 7272 and type B 
strains NCTC 7273 and 17B encode genes for HA-17, HA-33, 
and P-21 (linked in this order) upstream of those for NTNH 
and BoNT (Fig. 1) (8). The region encoding HA-33 and P-21 
was cloned from type A strain 62A (Fig. 1) and was found to be 
identical to that from type A strain NCTC 7272 (8). To deter- 
mine if the HA-33 gene is present in other strains, PCR prim- 
ers based on conserved amino acids were developed (primers 
HI and H2 [Table 2]). Positive results (PCR product of ca. 1.2 
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FIG. 3. Trees revealing the .discordance in relationships along the lengths of 
NTNHs, residues 1 to 550 (ij, residues 551 to 1020 (ii), and residue 1021 to C 
terminus (iii). NTNH sequences from the following were compared: proteolytic 
C botulinum type A strains 62A (Al) and Kyoto-F (A2), type B strain NCTC 
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and nonproteolytic C. botulinum type F strain Eklund 202F (Fnp) (6). The 
numbers on the tree indicate bootstrap values for branch points; only values of 
>90 are shown. 
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kb) were obtained for C. botulinum strains encoding type Al 
[including A1(B)] and B but not for those encoding type A2, E, 
F, and G or toxigenic strains of C. barati and C. butyricum 
(Table 1). Although the lack of a PCR product may be due to 
the mismatch of primers with the target gene, these results 
correlate with the presence or absence of HA activity in the 
progenitor complexes; strains encoding types A and B, which 
produce complexes with HA activity, give positive results, 
while strains which lack HA activity (types A2, E, and F and 
strains of C. barati and C. butyricum) give no PCR product. 
Although type G strains produce complexes which contain 
components with HA activity, purification of the components 
shows that none correspond to HA-33 of type A, B, or C (25). 



Hybridization of the PCR products with probe H33/A (Fig. 
5A) shows a positive result for all type Al, A1(B), Al/B, and 
Bf strains. In contrast, the probe designed to detect HA-33 of 
proteolytic B (H33/Bp) gave a signal only with the positive 
control and not with any other strains, including the type 
A1(B) strains (Fig. 5C). The probe to detect HA-33 of non- 
proteolytic type B (H33/Bnp) hybridized only with the PCR 
product from the nonproteolytic B strains (Fig. 5B). Interest- 
ingly, all of the A1(B) strains examined hybridized only with 
probe H33/A, indicating the presence of only type A HA-33. 
Sequence analysis of C. botulinum type A1(B), strain 667, 
showed only one HA-33 gene to be present; although it was 
located in the B cluster, it showed the highest homology with 
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FIG. 6. Comparison of derived amino acid sequences for P-21s from C. botutinum type A strains 62A (Al), Kyoto-F (A2K), Chiba-H (A2C), and NCTC 9837 (A29); 
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identical and similar amino acids in five or more sequences, respectively; dots, absent residues. 



HA-33/A (ca. 99% identity) and possessed the hybridization 
site of the H33/A probe (21). The single Bf strain examined 
(strain 3281) hybridized only with the HA-33 type A probe 
(Fig. 5). Further experiments are under way to investigate this 
result. 

Elucidation of P-21 gene sequences in C. botulinum types A2 
and F. A gene encoding a potential regulator of the BoNT 
complex was reported for C. botulinum types A and B (desig- 
nated P-21) (8) and type C (designated orf-22) (19). This gene 
is clustered with those of the components of the BoNT com- 
plex, although it maps to different positions in strains of dif- 
ferent toxin types. PCR primers based on conserved regions of 
P-21 (PI and P2 [Table 2]) were designed and used to inves- 
tigate the presence of this gene in ether strains (Table 1). No 
PCR products were obtained for any of the strains of non- 
proteolytic C. botulinum type F, C. botulinum type E, C. barati, 
or C. butyricum examined. PCR amplification products with 
the expected size (ca. 400 bp) were obtained for all strains 
encoding types Al, A1(B), B, and G for the single proteolytic 
C botulinum type F strain and three of the type A2 strains 
examined (Table 1). The PCR amplification products from the 
proteolytic C. botulinum type F strain (Langeland) and the 
three type A2 strains (Kyoto-F, Chiba-H, and NCTC 9837) 
were cloned, and their sequences were determined. Very high 
sequence homology was observed between the P-21 proteins 
from these type A2 strains and type F strain Langeland (ca. 
91% sequence identity). The P-21 proteins from type A2 
strains and type F strain Langeland show ca. 63% amino acid 
identity with the P-21 proteins from C botulinum type Al and 
type B strains (proteolytic and nonproteolytic), ca. 55% se- 
quence identity with the orf-22 gene product from C. botulinum 
type C (19), and ca. 26% identity with the uviA gene product 
from Clostridium perfringens (17). UviA is thought to be a 
DNA-binding protein with a possible regulatory function (24). 
Figure 6 shows a comparison of the sequences of the P-21 



proteins of C. botulinum and UviA from C perfnngens. To 
determine the position of the P-21 gene in strains Langeland 
and Kyoto-F, PCR strategies were employed. The P-21 gene 
was found to map in an identical position ca. 2 kb upstream of 
the NTNH gene in both of these strains. The orientation of 
P-21 is such that it is transcribed away from the 5' end of the 
NTNH gene (Fig. 1). 

Analysis of a new open reading frame. In an earlier study, we 
reported the 3' end of an unknown orf located upstream of 
NTNH gene in nonproteolytic C. botulinum type F strain Ek- 
lund 202F (6). Cloning and sequencing the region between the 
NTNH and P-21 genes revealed an orf, which was preceded by 
a putative Shine-Dalgarno sequence, encoding a protein of 416 
amino acids in both proteolytic C. botulinum type F (Lange- 
land) and type A2 (Kyoto-F) strains. This protein has been 
designated P-47, since it has relative molecular weight values 
of 47,476 in strain Kyoto-F and of 47,420 in strain Langeland. 
A comparison of the derived amino acid sequences of P-47 
proteins for strains Kyoto-F and Langeland shows ca. 79% 
identity overall but ca. 96% identity in the first 200 amino 
acids. The translated 3' end of the orf in C. botulinum type F 
strain Eklund 202F shows homology to the C termini of P-47 
proteins encoded in strains Langeland and Kyoto-F (ca. 66 to 
69% identity over 271 amino acids [7]). Interestingly, the 3' 
end of this orf is also found in C. botulinum type A(B) strain 
667 (21), and translation of sequence upstream of the NTNH 
gene in C. botulinum type E (14) and C. butyricum (13) reveals 
an orf encoding a protein with homology to P-47s of Kyoto-F 
and Langeland (ca. 54 to 80% identity). This gene maps in 
identical positions in these strains (Fig. 1). The gene encoding 
P-47 is, therefore, widely distributed among strains encoding 
BoNT and is linked or clustered with genes encoding neuro- 
toxin complex components. It may be pertinent that this gene 
has been found only in toxin gene clusters which lack genes for 
HA components of the complex. 
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DISCUSSION 

It has long been recognized that strains of C botulinum are 
physiologically heterogeneous, and four distinct phenotypic 
groups (I to IV) are recognized (18). These four metabolically 
distinct groups do' not, however, necessarily correlate with the 
serological specificities of the BoNT produced (18). Genealog- 
ical analysis based on 16S rRNA gene sequences has shown 
that C. botulinum consists of four highly divergent lines which 
correlate with the four phenotypic divisions (22). Recent se- 
quencing studies of the genes encoding BoNT have unequiv- 
ocally demonstrated the marked discordance between trees 
depicting natural relationships (i.e., 16S rRNA) of the organ- 
isms and those of the various BoNT types produced (e.g., see 
references 3, 10, and 20). In particular, the results of these 
comparative sequencing studies are strongly indicative of 
BoNT gene transfer between C botulinum metabolic groups or 
rRNA lines (viz., BoNT/B in groups I and II, BoNT/E in group 
II and toxigenic C. butyricum, and BoNT/F in groups I and II 
and toxigenic C barati). As outlined earlier, the gene encoding 
BoNT is clustered with several other genes encoding compo- 
nents of the progenitor complex. Since BoNT is responsible for 
botulism and BoNT is the primary trait for defining and sub- 
dividing the so-called species C. botulinum, a thorough under- 
standing of the evolutionary interrelationships of the genes 
encoding this toxin and other components of the progenitor 
complex is essential. Knowledge of the organization of the 
cluster of genes encoding BoNT complex components is very 
limited. Information on the genealogy of the various progeni- 
tor proteins is also unknown, and since genes encoding these 
are proximal to that responsible for BoNT production, it is 
likely that their genealogical patterns would be similar to that 
of the neurotoxins. In this study, we have examined the ar- 
rangements of genes encoding components of the BoNT com- 
plex in proteolytic (group I) C. botulinum type A, B, and F 
strains. Two different arrangements were found. Type A strain 
62A and type B strain NCTC 7273 possessed similar organi- 
zations, viz., BoNT, NTNH, P-21, and HA components; type F 
strain Langeland and infant type A strain Kyoto-F exhibited 
gene organizations similar to each other, viz., BoNT, NTNH, 
P-47, and P-21, and appeared to lack HA genes. In addition to 
this discordance in gene organization versus BoNT type, we 
have shown that the genealogy of the NTNH (the gene encod- 
ing NTNH is linked to the BoNT gene) does not mirror the 
evolutionary history of BoNT. Furthermore, we have found 
evidence of chimeric NTNH gene sequences, which could be 
symptomatic of recombination, and indicate that the NTNH 
gene may be a hot spot for such events within the BoNT 
complex gene cluster. There is clearly a strong case to extend 
the described studies to embrace further group I strains and C. 
botulinum groups (including other BoNT-producing species) to 
obtain a better understanding of population structure within 
this important pathogen and a complete picture of the evolu- 
tionary histories of its neurotoxin and associated proteins. 
Such genetic information is of great relevance to the taxonomy 
of C. botulinum, which is currently defined and subdivided on 
the basis of the type of neurotoxin produced. 
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Chromosomal DNA was extracted from toxigenic Clostridium butyricum strain BL6340 isolated from a case of 
infant botulism. After digestion by EcoRl, a DNA fragment of about 1 kbp was cloned into Escherichia coli using 
Agtll, and was subcloned into pUC118. The K coli cells transformed with this cloned fragment produced a 33 kDa 
protein which reacted with monoclonal antibodies recognizing the light chain (Lc) component of botulinum type E 
toxin. The nucleotide sequence of the cloned fragment was determined. The sequence was similar to that from 
botulinum type E toxin gene fragments previously determined by our laboratory (strains Mashike, Otaru and 
Iwanai). Several highly homologous sequences among the botulinum type A, C, E, butyricum and tetanus toxin 
genes were found in both translated and untranslated regions. These results suggest that the toxin gene of C 
butyricum may have evolved by transfer from C botulinum. 



Introduction 

Clostridium botulinum is the causative agent of food- 
borne botulism or infant botulism. Neurotoxins pro- 
duced by the bacteria are classified into seven groups (A 
to G) based on their antigenicity. Recently, however, 
Clostridium species other than C. botulinum have been 
found to produce botulinum neurotoxin. C. butyricum 
and C barati produce type E and type F neurotoxins, 
respectively, which are implicated in the onset of human 
infant botulism (Hall et al, 1985; McCroskey et al, 
1986). The DNA extracted from a type E toxin- 
producing C. butyricum strain showed high homology to 
that obtained from a typical nontoxigenic C. butyricum 
strain (Suen et al, 1988). It is therefore important to 
establish whether toxin genes exist in toxigenic and 
nontoxigenic C. butyricum strains, and to clarify the 
mechanism of transfer of toxin genes from C botulinum 
to other Clostridia. The N-terminal amino acid sequences 
of botulinum type A, B, C, D and E toxins, and of the 
toxin produced by toxigenic C. butyricum, have been 
determined (Gimenez et al, 1988; Moriishi et al, 1989; 

Abbreviation: mAb, monoclonal antibody. 

The sequence data in this paper have been submitted to EMBL and 
have been assigned the accession number X53180. 



Sathyamoorthy et al, 1985; Tsuzuki etal, 1988). The 5'- 
terminus and complete nucleotide sequence of type A 
toxin gene has been reported (Betlay et al, 1989; Binz et 
al, 1990; Thompson et al, 1990). Recently, the complete 
nucleotide sequence of the structural gene of botulinum 
type C toxin and the 5'-terminal nucleotide sequence of 
the type E toxin gene (strains Mashike, Otaru, Iwanai) 
have been determined in our laboratory (Fujii et al, 
1990; Kimura et al, 1990). The nucleotide sequences 
obtained from three type E toxin genes were identical. In 
this paper, we report the cloning, sequencing and 
expression of a toxin gene fragment from a toxigenic C. 
butyricum strain, and compare the 5'-terminal nucleotide 
sequence (and the deduced amino acid sequence) to those 
of botulinunUoxins type A, C and E, and tetanus toxin. 



Methods 

Extraction of DNA from cells. One toxigenic strain of C. butyricum 
(BL6340, isolated from a case of infant botulism and kindly provided 
by Dr Hatheway, Centers for Disease Control, Atlanta, USA) and nine 
nontoxigenic strains (KZ145 and KZ589, provided by Dr Hatheway; 
IFO14810, IFO20064, Y-M-la, Y-M-2a, Y-M-3a, Y-B-la and 
IAM19001, provided by Dr Arai, Biotherapy Research Institute, 
Tokyo, Japan) were used. These strains were cultured in 250 ml LYG 
medium (1% (w/v) lactalbumin hydrolysate, 2% (w/v) yeast extract, 
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#5J£.(w/v) 'giiicose, 4 0-,15%l(w/v) cysteine hydrochloride,' pH 7-2) ; at 
30 °C for 1 2 -hViTlije cellV were collected by centrifugation at , 6000 r.plm.'.,: . 
(II 000*) for^^^t^mAemperaiiire; ari^resujspehded in 20 nil *• ' 
LYG mediuni : Stainihg ; 8% (w/v) polyethylene glycol 4000, 10 U 
penicillin G ml" 1 and 10 ug lysozyme ml" 1 , and then incubated at 30 °C 
for 60-90 min. After centrifugation, the cells were lysed using 2-3% 
(w/v) of sodium dodecyl sulphate (SDS). Extraction and purification of 
chromosomal DN A were performed by the procedures described by 
Marmur (Marmur, 1961). 

Monoclonal antibodies. Three monoclonal antibodies (mAbs) against 
light chain (Lc) component of type E toxin, EL161-38, EL211-3, 
EL219-15, were those prepared previously (Tsuzuki et a/., 1988). 
EL21 1-3 and EL219-15 react only with type E toxin, but EL161-38 also 
reacts with Lc components of botulinum type B, C, D and tetanus 
toxins. 

Cloning and sub-cloning of the toxin gene. Cellular DNA from 
toxigenic strain BL6340 was digested by EcoRl (Takara Shuzo Co., 
Kyoto, Japan), ligated to Agtll £coRI arms (Promega Corporation), 
and then packed in vitro by using Packagene (Promega Corporation). 
The gene library was plated on E. coli Y 1090 (Promega Corporation), 
and positive clones were immunoscreened using mAb EL 16 1-38 and 
the ProtoBlot Immunoscreening System (Promega Corporation). The 
isolated positive clones were purified, and the recombinant phage 
DNA was extracted. After digestion with EcoRl, the inserts separated 
from phage DNA were then sub-cloned into the EcoRl site on the 
polylinker of pUCl 18 (Takara Shuzo Co.). 

DNA sequencing and Western blot analysis. DNA sequence analysis 
was performed by the dideoxy termination method using [ot- 35 S]dATP 
(NEN Products) and a T7 DNA sequencing kit (Pharmacia). 

Gene products from E. coli strain Y1090 infected with recombinant 
phage, and from E. coli strain MV1 184 transformed by recombinant 
plasmids, were analysed by Western blots with mAbs and with 
alkaline-phosphatase-labelled anti-mouse immunoglobulin G conju- 
gate (Promega Corporation), nitroblue tetrazolium and 5-bromo-4- 
chloro-3-indolyl phosphate. The transformed cells were incubated in 
10 ml 2xTY medium [1-6% (w/v) tryptone, 1% (w/v) yeast extract, 
0-5% (w/v) NaCl, pH 7-6] at 37 °C for 12 h. After centrifugation, the 
cells" were suspended in 1 ml TEP buffer (100 mM-Tris/HCl, pH 7*4, 
10 mM-EDTA, 1 mM-PMSF), sonicated for about 2min at 0°C, and 
centrifuged at 12000 r.p.m. for lOmin at 4°C. The resulting 
supernatants were electrophoresed on 12% (w/v) SDS-polyacrylamide 
gel, and Western blot analysis was carried out as described previously 
(Tsuzuki et aL, 1988). 

Dot blot hybridization. DNA samples (2, 20 and 40 ug) from toxigenic 
and nontoxigenic strains of C. butyricum were spotted on a nylon 
membrane (Pall Ultrafine Filtration Corp., Glen Cove, NY, USA). 
After denaturation and neutralization, the membrane was prehybri- 
dized in 50% (v/v) formamide hybridization buffer at 42 °C for 12 h, 
and then hybridized with 35 S-labelled toxin probe using a DNA 
labelling kit (Nippon Gene, Toyama, Japan) at 42 °C for 24 h 
according to the procedures described previously (Fujii et al., 1988). 
The membrane was washed at 68 °C and autoradiographed. 



Results and Discussion 

Cloning and gene products 

Chromosomal DNA isolated from the toxigenic C. 
butyricum strain BL 6340 was digested with EcoRl and 
ligated to Agtll EcoRl arms. After in vitro packaging, 
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Fig. 1. Western blot analysis of gene products by recombinant phage 
and plasmid. Cellular extracts from E. coli Y1090 infected with (1) 
Agtl 1 containing a 1 kbp EcoRl fragment of the butyricum toxin gene 
(ABU-LI); (2) Agtll containing a type E toxin gene EcoRl fragment 
(AEM-L1); (3) control Agtll ; and from E. coli MV1 184 transformed by 
pUC 1 1 8 containing butyricum toxin gene EcoRl fragment (pBU-Ll) (4 
and 6) or botulinum type E toxin gene EcoRl fragment (pEM-Ll) (5 
and 7). The extracts were characterized by Western blots using mAb 
ELI 61-38 against botulinum type E toxin light chain. The cells of lanes 
1-5, and lanes 6-7 were incubated in the presence and absence of 
IPTG, respectively. 



and infection into E. coli, the recombinant plaques were 
screened for the production of toxin fragments using 
mAb EL161-38. Several positive clones were detected 
and one, ABU-LI, was chosen for further study. ABU-LI 
DNA was prepared, and found to contain a single 1 kbp 
EcoRl insert. E. coli Y1090 cells infected with phage 
ABU-LI produced a 33 kDa protein which reacted with 
mAb EL161-38 both in the presence and in the absence 
of IPTG (Fig. 1). In addition, the infected cells reacted 
with mAb EL211-3, but not with mAb EL219-45 (data 
not shown). The 10 kbp EcoRl fragment was isolated 
from ABU-LI, and then recloned into the EcoRl site of 
pUC118. E. coli strain MV1184, when transformed by 
the recombinant plasmid (pBU-Ll) produced a protein 
of similar size (34 kDa) as that obtained from the phage 
clone (Fig. 1). These gene products migrated slightly 
faster than those produced by E. coli cells that had been 
transformed with phage or plasmid expressing an 
analogous fragment (about 1 kbp) which encodes the 
botulinum type E toxin gene (Fujii et aL, 1990). 

Susceptibility of insert to endonuclease 

The 10 kbp EcoRl fragment from pBU-Ll was digested 
with several restriction enzymes, and the restriction map 
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100 bp 

Fig. 2. Restriction endonuclease recognition sites in the fragments 
from C. butyricum (a) and C. botulinum type E The map was 
determined by 4% NuSieve agarose gel electrophoresis of the plasmid 
DNAs cleaved by EcoRI (E), Drai (D) and //mcll (H). 



was found to be similar, but not identical, to that of the 
analogous 1 -0 kbp fragment from the C. botulinum type E 
toxin gene (Fig. 2). 



Nucleotide and derived amino acid sequences of 
butyricum toxin gene EcoRI fragment 

The 5'-terminal nucleotide sequence (756 bp) of the 
butyricum toxin gene and 227 bp of the 5'-untranslated 
region was determined. The number of nucleotides of the 
coding and noncoding regions was the same as that of the 
cloned type E toxin gene fragment (Fujii et ai, 1990). 
However, the nucleotide sequences of the butyricum and 
botulinum type E toxin genes were not identical; eight 
bases in the untranslated region, and 27 bases in the open 
reading frame were different (Fig. 3). A Shine-Dalgarno 
consensus sequence (AGGAGA) was found 14 nucleo- 
tides upstream from the translation initiation start codon 
(ATG). A base sequence corresponding to the -10 
consensus region of E. coli (TATAAT) was identified at 
position 87, but no sequence corresponding to the -35 
consensus sequence of E. coli (TTGACA) was detected. 
Two stem-loop structures, at bases 17 to 48 and 121 to 
155, were also detected in the region upstream of the 
- 10 region and the Shine-Dalgarno sequence, respec- 
tively. The deduced N-terminal amino acid sequence of 
butyricum toxin was identical to that previously reported 
by protein analysis of the toxin (Gimenez et aL, 1988). It 
was also similar to the protein sequence of the botulinum 
type E toxin (strains Mashike, Otaru and Iwanai) except 
for 17 amino acid residues, which were altered by the 
nucleotide changes described above (Fig. 4). 



. '■ Botulinum toxin gene in butyricum 

;&m]mmpr§^ acid sequences of the 

putafiMt^^^h^^h; i^ose of botulinum Jype A, C and 
tetanus toxins±y, t ; ■ . . * 

As described above, the nucleotide sequences of butyri- 
cum and type E toxin gene fragments were similar in 
both the noncoding and coding regions. Since the 
nucleotide sequences of types A, C and E, and tetanus 
toxin gene have been determined, the sequences of these 
five toxin genes were compared. There exist several 
regions of homologous sequence in both untranslated 
and translated regions (Fig. 3). In the untranslated 
region, it was very easy to find sequences corresponding 
to a Shine-Dalgarno sequence which is presumably 
involved in the initiation of translation of the toxin 
genes. In contrast, the sequence corresponding to the 
- 10 and - 35 regions was difficult to identify, due to the 
high A-T content of the DNA. However, one potential 
-10 region was identified 121 bp upstream from a 
Shine-Dalgarno region. 

In addition to these probable -10 and Shine- 
Dalgarno regions, we identified highly homologous 
sequences of ATGATAAAT and TATAAAAAATCA 
at about 2 and 21 bp downstream of the probable - 10 
region of the C. butyricum, C. botulinum type E and 
tetanus toxin genes, and the sequence ATATTAAA was 
found between the probable promoter and Shine- 
Dalgarno regions in all of the C. butyricum, type A, type 
C, type E and tetanus toxin genes. We propose th^t these 
homologous sequences may play a role in regulation of 
expression of the toxin genes. 

It has been reported that tetanus toxin and the 
different types of botulinum toxins possess several 
consensus amino acid sequences of between 7 and 20 
residues in length (Tsuzuki et at., 1988). In addition to 
these sequences, we found three other highly homologous 
regions in botulinum type A, C, E, butyricum and 
tetanus toxins (Fig. 4). The nucleotide sequences 
corresponding to these three homologous regions were 
also quite similar to each other (Fig. 3). One region 
(amino acid residues 65 to 85 in butyricum toxin) was 
hydrophilic, and another, the region 201-223, was 
hydrophobic. This hydrophobic region might conceiv- 
ably play a role in spanning the mammalian cell 
membrane, as suggested for tetanus toxin by Eisel et al. 
(1986). 

Detection of toxin gene in toxigenic and nbntoxigenic 
strains of C. butyricum 

DNA samples (2, 20, and 40 ug) from one toxigenic and 
nine nontoxigenic strains of C. butyricum were spotted on 
a nylon membrane, and the membrane was hybridized 
with the 1 kbp EcoRI probe from the C. butyricum toxin 




522 N. Fujii and others 



Butyricum (BL6340) 
Type E (Mashike) 

Type A (62A) 
Type C (c-si) 
Tetanus 



CAATTCAATT AGTAG ATAAC A A A A A TA A T C CAA ACACTTT TATTATT AAT AATCATATAT TTATTTCTAA 
c A_- T 7l?C---T C 

ACTAATACAT AACAAAAATA ACCCAAACAA GATCATA ATT ACTAATCATA TATTTATTTC CAATTCTTTA 

TCCTCA AAAT AATACAATAC A A TTAGTA AG TTCCAAACAT AATGCAAAAA AGATTACAGT TAATACTGAT 

TTATAATTTA ATTATGA ATA ATATTCTTAA GATAAAAACT AAATTTTTAA AAATTTAAAT TTTCACTTTA 



Butyricum 
Type E 

Type A 
Type C 
Tetanus 



CTGTTTAACT TTAA CT TATA AT A ATGTAAA TGTA TATTTA TC TATAAAAA ATCA AGATTA CAATTCCCTT 
^ — —5.3 



ACCCTATCTT ATAACGGTAA ATATA TATGT T TATCTATCA A AGATCAA AA CCATAATTCC ATCATATCTA 
TTATTTAGAC CTCATTG TAT A AC ATTTTCA TATA A TGATA A ATA TTTTTC TCTATCACTT AGACATGGAG 
CAAA AAATAA CCTGAT TATG TT AT ATGTAA TTGTA A AAAA CA TATA AA A A ATCA GAAAAA TTTACGAGGT 



Butyricum 
Type E 

Type A 
Type C 
Tetanus 

Butyricum 
Type E 

Type A 
Type C 
Tetanus 

Butyricum 
Type E 

Type A 
Type C 
Tetanus 

Butyricum 
Type E 

Type A 
Type C 
Tetanus 

Butyricum 
Type E 

Type A 
Type C 
Tetanus 



ATATGTGATC TTAATCATC A TATACCCAA A AACTCATATC TATGG ATATT A AA AAATATA TAA ATTTAA A. 
A 

ATA'ATCATAT GTCA A AGTAT TTCTATTTAT CCTCATTTAA ATAATTAATA ATTTAATTAA TTTTAAATAT 

ATTATAATTG CATCATATCT AATGACAATA ACAACGTCCT AAACCTCCAC ATTTGTCCAT ATTAGA AAGT 

ATATTATTA A TGGATTA AAT A ATA ATTTTT. TAATTTACTT TTGATTAATA A ATATTA A A T CTTTATTTTA 
OR F 

t gctgtatIat g CCAACAATTA ATAGTTTTAA TTATAATGAT CCTGTTAATA ATAGAACAAT 

i c 



ATTAGGAGA 



TAJT^ AGACCjT CTTAAATATC_CCATTT CTTA ATA A ACA ATT TAATTATAAA GATCCTCTA A ATGGTGTTGA 

^GAGA^CT TAGTATTATC_CCAATA ACAA TTAACAACTT TAATTATTCA GATCCTGTTC ATAATAAAAA 

at^acgacaIt GATACCTATC CCAATA ACCA TAAATAATTT TAGATATAGT CATCCTGTTA ATAATGATAC 
SD 

TTTATATATT AA ACCAGGCG GTTGTCA ACA ATTTTATA A A TCATTTAATA TTATGA A A A A TATTTGGATA 
__ -G- - - 

TATTCCTTAT ATAAAAATTC CAAATCCACC ACAAATCCAA CCAGTAAA AG CTTTTA AAAT TCATAATAA£ 

TATTTTATAT TTAGATACTC ATTTAAATAC ACTACCTAAT GAGCCTCAA A AACCCTTTCG CATTACAGGA 

AATTATTATC ATCCAGCCAC CATACTGTAA CCGTCTAGAT ATCTATTATA AGCCTTTCAA AA.TA ACAGAT 

ATTCCAGACA GAAATGTAAT TGGTACA ATT CCCCA AGATT TTCTTCCGCC TACTTCATTG AAA AATCGAG 
CC A A 

ATATGGGTTA TTCCAGA A AG AC ATACATTT ACAA ATCCTG AACAAGCAGA TTTA A ATCCA CCACCACAAC 

AATATATCCC TAA TA CCTGA TA GATTTTCA ACAAATTCTA ATCCA A ATTT AAATAAACCT CCTCCACTTA 
CGTA 



TTTGGA TAGTGCCCGA AAGGTATGAA TTTGGGACAA AACCTGAACA TTTTAACCCA CCATCTTCAT 



ATaTtACTTA *TTATGACCCT AATTATTTAC AAAGTGATCA AGAAAAGGAT AAATTTTTAA A A ATAGTCAC 



CAAAACAACT TCCAGTTTCA TATTATGATT CA A CATATTT AAGTAC AGAT AATGA A AA AG ATA ATTATTT 
CAAGCCCTAA A AGTCCTTAT TATGATCCTA ATTATTTGAG TACTGATTCT GACA A AC A TA CATTTTTA A A 
TAATAGAACG TGCATCTCAC TATTACCATC CA AATTATTT A AGGACTGAT TCTCATAA AG ATAGATTTTT 



Butyricum 
Type E 



AAA AATATTT AATACAATAA ATGATAATCT TTCAGCAACC ATTTTATTAG AACAACTCTC AAAAGCTAAT 
A — - G- - - 



Botulinum toxin gene in C butyricum 



Type A 
TypeC 
Tetanus 



AAAGGCACTT ACAA A ATTAT TTGACACAAT TTATTCA ACT CATCTTCCAA CAATCTTCTT AACATCAATA 
ACAAATTATA AAGTTATTTA A AAGA ATTA A TTCTACAGA A ATACCACAAG AATTAATATA TACACTTTCC 
ACAAACCATC CTAAAACTCT TTAACACAAT TAAAAACAAT CTAGCACCTC AACCCTTATT acataacata 



Butyricum ccatatttac gaaatc'ataa TACTCCAGAT CGTCACTTCA TTATTAATCA TCCATCAGCA GTTCCAATTC 
Type E -G AA-C-A---C A GG ---GAC---A 



Type A 
Type C 
Tetanus 



GTAACGCGAA TACCATTTTG CGGTCCAAGT A CA A TAC A TA CAGAATTAAA ACTTATTCAT ACTAATTGTA 
ACAGATATAC CCTTTCCTGG GA ATA ACAA T ACTCCAATTA ATACTTTTCA TTTTCATCTA CATTTTAACA 
ATA A ATGCCA TA CCTTACCT TCCA A ATTCA TATTCCTTAC TAGACA ACTT TGATACAA AC TCTA ATTCAG 



Butyricum AATTCTCAAA TGGTACCCA A ACCATACTAT TACCTAATGT TATTATAATC CCACCACACC CTCATTTATT 

Type E CA 

TypeA TTA ATGTCAT ACAACCACAT CCTACTTATA CATCAGAACA ACTTAATCTA GTAATAATAG GACCCTCACC 

TypeC CTCTTCATGT TAAAACTACA CAACCTA ACA ACTCGCTTAA A ACTCGTAGC ATAA ATCCTA GTCTTATA A T 

Tetanus TA TCTTTTA A TTTATTACAA CAAGACCCCA CTCCAGCAAC TACAAAATCA CCAATCCTCA CA A ATTTAAT 



Butyricum 
TypeE 



TCA AACTAAC AGTTCCA ATA TTTCTCTAAG A AATA ATTAT ATCCCAACCA A TCACCCTTT TCGATCA ATA 



Type A 
Type C 
Tetanus 

Butyricum 
Type E 



TCATATTATA CACTTTCAAT GTAA AACCTT TGC ACATCA A CTTTTCAATC TTACCCGA A A TCCTTATCCC 
AACTCCACCT ACACAAAACA TTATAC ATCC ACAA ACTTCT ACGTTTA A AT TA ACTA AC A A TACTTTTCCC 
A ATATTTCGA CCTGCCCCTC TTTTA A A TA A A AATGACCTT ACACGTATTG TATTC AGCCT ACATAATAAA' 

pC pi 

CCTATACTAA CATTCTCACC TCAATATTCT TTTACATTTA AACATAATAC TATCAATCAA TTTATTCAAG 



Type A 
TypcC 
Tetanus 

Butyricum 
Type E 



TCTACTCAAT ACATTACATT TACCCCACAT TTTACATTTG CTTTTGACCA CTCACTTCAA CTTCATACAA 

CCACAAGAAC CA TTTCGTGC TTTATCAATA ATTTCAATAT CACCTACATT TATCCTA A CA TATACTAATC 

A ATTACTTCC CATCTAGACA TGCTTTTCGC TCA ATAATCC AAA TCGCATT TTCCCCACAA TATCTACCTA 

ATCCTCCTCT TACATTA ATC CA TGA A TTA A TACATTCATT A CATCC A CTA TATCCCCCTA AAGCCATTAC 



TypeA ATC CTCTTTT ACGTCCACCC A A A TTTGCTA CACATCCACC AGTA ACATTA GCACATGAAC TTATACATCC 

TypeC CA ACTA ATGA TGTAGG AGAG CGTACATTTT CTAAGTCT GA ATTTTGCA TG GATCCAATAC TAATTTTAAT 

Tetanus CCTTTGATA A TCTAATACAA A ATATTA CGT CACTCACTAT TGGCAAA AGC A A A TA TTTT C A AGATCC AGC 



Butyricum 
Type E 



TACA ATGTAT ACTATAACAC AAAAACAAAA TCCCCTA ATA ACAAATATA A CACGTACAAA TATTC AAGA A TTC 



TypeA TCGACATACA TTATATGGA A TAGCAATTAA TCCAAATAGG GTTTTTAAAG TAAATACTAA TGCCTATTAT gaa 

TypeC GCATGAACTT A A TCA TGCA A TGCATAATTT ATATGGA ATA GCTATACCA A ATCATCAAAC AATTTCATCT CTA 

Tetanus ATTACTATTA ATGCACG AA C TTA TA CA TCT ACTACATCCT TTATACCCAA TCCACCTATC AACCCATCAA ATT 



Fig. 3. Nucleotide sequences of the 5'-translated and 5'-untranslated regions of butyricum, botulinum and tetanus toxin genes. The 
DN A sequence of the butyricum toxin gene EcoRl fragment is compared to the sequence of botulinum type A (Binz et at., 1990), type C 
(Kimura et at., 1990), type E (Fujii et ai t 1990) and tetanus (Eisel et at., 1986) toxin genes. The putative Shine-Dalgarno sequence is 
indicated by SD. The sequences homologous to the - 10 region of the E. coli promoter (TAT A AT) are marked with double underlines. 
A sequence with similar dyad symmetry is underlined by divergent arrows. Numbers represent free energy values in kcal mol' 1 
(1 kcal = 4-2 kJ). Dashes in the botulinum type E sequence represent bases identical to the C. butyricum sequence. The highly 
homologous sequences are underlined, and the start site of homology is indicated on the butyricum sequence by A (type A), C (type C) 
and T (tetanus). ORF is the translation initiation site of ATG. 
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Fig 4 Amino acid sequences derived from the base sequence of butyricum toxin gene EcoR\ fragment. Dashes in the botulmum type E 
sequence represent amino acid residues identical to the C. butyricum sequence. The highly conserved homologous sequences are 
underlined. Arrows indicate highly conserved amino acid residues among the five toxins, and open circles md.cate common amino acid 



residues in four out of five toxins. 



Botulinum toxin gene in C. butyricum 



gene. A positive reaction was observed only with DNA 
from the toxigenic strain, BL6340 (data not shown). The 
nucleotide sequences of the toxin genes from C. 
butyricum and C. botulinum strains were quite similar in 
both coding and noncoding regions. Therefore, it is quite 
possible that a nontoxigenic C. butyricum strain has 
become toxigenic by acquiring the toxin gene from a C. 
botulinum type E strain; the toxin gene could perhaps 
have been transmitted from C. botulinum type E to C. 
butyricum cells through some transmissible vector, e.g. a 
bacteriophage or a plasmid. 

This study was supported by grants from the Ohyama Health 
Foundation and the Mishima Memorial Foundation. 
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Summary - The flaccid paralysis in the neuromuscular disease botulism appears to depend on the 
coordinated roles of the -50 kDa light and -100 kDa heavy chain subunits of the -150 kDa neurotoxic 
protein produced by Clostridium botulinum (J. Biol. Chem. (1987) 262, 2660 and Eur. J. Biochem. 
(1988) 177, 683). We observed that the light chain after separation from its conjugate heavy chain, in 
the presence of dithiothrcitol and 2 M urea, begins to split into -28 and -18 kDa fragments. The other 
subunit-thc -100 kDa heavy chain following its isolation-and the parent -150 kDa dichain neuro- 
toxin do not break down under comparable conditions. This cleavage was examined in the neurotoxin 
serotypes A and E. The cleavage does not appear to be due to a protease. Partial amino acid sequences 
established that: i) the -28-kDa and -18-kDa fragments comprise the N- and C-terminal regions of 
the light chain, respectively; ii) the light chain of the neurotoxin serotypes A and E break down at 
precise peptide bonds; iii) the peptide bonds cleaved in serotypes A and E are five residues apart; and 
iv) the portions of the -18 kDa fragments of serotype A and E neurotoxin sequenced so far arc highly 
homologous to the corresponding region of tetanus neurotoxin produced by Clostridium tetani The 
partial N-terminal sequence of the ^28 kDa fragment matches with the N-terminal sequence of the 
intact L chain. The 47 residues of the -18-kDa fragment of type A sequenced from its N-terminal are: 
-y.E.M.S.G.L.E.V.S.F.£^.R.^^ 

D7.AS.T.L.-. These align with those of tetanus neurotoxin beginning at its residue #259 (Tyr); the 
18 underlined residues of the above 47 residues (i.e. 38%) are identical in positions between the two 
Droteins The 41 residues sequenced from the -18 kDa fragment of type E botidmum neurotoxin are: 
-K.GV.N.i:O.F.L. ~7^.G .N.^^ 

Z.-. These align with tetanus neurotoxin, beginning at its residue #264 (Tyr); the 20 underlined 
residues (i.e. 49%) are positionally identical to tetanus neurotoxin. Also the 13 overlined residues of 
type E (i.e. 32%) are positionally identical to type A botulinum neurotoxin. 

botulinum type A and E neurotoxin /light chain /telanus neurotoxin /sequence /homology 



Introduction 

Botulinum neurotoxin (NT) blocks release of the 
excitatory neurotransmitter acetylcholine, 
and thus causes flaccid paralysis in the neuro- 
muscular disease, botulism. The -15C kpa NT 
is produced by Clostridium botulinum in 7 anti- 



gcnically distinct forms, called types A-G. The 
single chain polypeptide is nicked at 1/3 the 
distance from the N-terminal by protease 
endogenous to the bacteria or by exogenous 
proteases such as trypsin. The —50 kDa light (or 
L) and -100 kDa heavy (or H) chain subunits, 
containing the original N- and C-terminal ends. 
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respectively, remain linked by noncovalcnt and 
disulfide hond(s). The two subunits have been 
separated chromatographically and analyzed for 
amino acid compositions and partialamino acid 
sequences |1-4J. These two subunits of the NT 
appear to have distinct roles in the mechanism 
of action of the -150 kDa NT. Experiments with 
neuromuscular junction preparations and 
isolated L and H subunit chains from types A 
and B NTs suggest that binding of the H chain 
with the specific sites on the nerve terminals 
permits the L chain or some combination of the 
L and H chains to enter the cell; then presuma- 
bly the L chain causes blockade of release of the 
neurotransmitter acetylcholine and thus brings 
about paralysis [5, 6). Recently types A and B 
NTs and the L chains isolated therefrom, but not 
the corresponding isolated H chains, were found 
to inhibit catecholamine release from digitonin- 
permeabilized bovine chromaffin cells [7j. 

We noticed [4] that the L chain of type A NT, 
following its separation from the H chain and 
storage for some time, yielded by polyacryl- 
amide gel electrophoresis in the presence of SDS 
(PAGE-SDS) two extra bands, -28 and 
-18 kDa, in addition to the 53 kDa L chain 
band. These or similar bands did not originate 
from the separated 97 kDa H chain or the parent 
145 kDa NT. Because the type A NT is isolated 
from C. botulinum (strain Hall) that is proteo- 
lytic [8], we extended our study of the time 
dependent appearance of —28 and —18 kDa 
bands to the type E NT which is produced by 
nonproteolytic C. botulinum (strain, E-43) [8]. 
The L chain subunit isolated from type E NT 
[4] but not the H chain, was also found to yield 
-28 kDa and -18 kDa fragments. These and 
other experimental observations argue against 
the probability that the breakdown of the L 
chain at a precise site is mediated by traces of 
protease that could be present in the purified NT 
and suggest that the breakdown is spontaneous 
(see Discussion). Further studies on the struct- 
ure-function relationship of the NT using 
isolated L chain may take into account the 
breakdown of the L chain reported here. 

Partial amino acid sequences of the —28 kDa 
and - 1 8 kDa fragments of the L chains revealed 
extensive amino acid sequence homology 
i), between the antigcnically distinct botulinum 
NT types A and E; and ii), among the two 
botulinum NT serotypes which cause flaccid 
paralysis, and tetanus NT which blocks release 
of inhibitor neurotransmitter and causes spastic 
paralysis in tetanus. 



Materials and methods 

Botulinum NT serotypes A (strain Hall) and E 
(strain E-43) were produced and purified in our 
laboratory [9, 1UJ. The H and L chain subunits of 
type A NT were separated on ion exchange columns 
(4] . Because the type E NT is isolated from the bacte- 
rial culture as a 147 kDa single chain protein [10] it 
was nicked with trypsin to generate the 147 kDa 
dichain form in order to separate its subunit chains 
[4]. Details of nicking with trypsin, inactivation of 
trypsin, chromatographic separation of the chains, 
their purity, and partial N-terminal sequences of 
the subunits of type A and E have already been 
described [4J. Soybean trypsin inhibitor, aprotinin 
and phcnylmethanc sulfo'nyl fluoride (PMSF), all 
purchased from Sigma Chcm., were used as protease 
inhibitors. 

The L chains of types A and E NTs following 
separation from their conjugate H chains on 
QAE-Sephadcx columns were recovered in 
borate -phosphate buffer pH 8.4, containing 2 M 
urea and at least 10 mM dithiothrcitol (DTT) (see [4) 
for exact compositions of buffer). The -50 kDa L 
chains were dialyzcd extensively at 8°C, against 
0.02 M Na 2 MP0 4 -NaM2P0 4 buffer, pH 6.0 con- 
taining 5 mM DTT and then precipitated with solid 
ammonium sulfate (39 g/100 ml, Ultrapure, 
Schwarz / Mann) and stored at 8°C. At suitable inter- 
vals the protein samples were recovered by ccntrifu- 
gation, dissolved in electrophoresis buffer [11] and 
analyzed by SDS-PAGE [11]. SDS-PAGE of the 
protein samples in cylindrical gels has already been 
described [11 J. The following proteins (Pharmacia; 
m.w. indicated in parentheses) were used to estimate 
the size of the fragments of the L chain; phosphory- 
lasc b (94 000), bovine scrum albumin (67 000), 
ovalbumin (43 000), carbonic anhydrase (30 000), 
soybean trypsin inhibitor (20 100), a-lactalbumin 
(14 400). Migration distance plotted against log 
molecular weight of the proteins provided the 
calibration curve [12]. 

For amino acid sequence the Coomassie blue- 
stained gels were sliced with a razor blade, and the 
stained protein was recovered from the gel by elec- 
trocution [13], us was done before [14]. A gas-phase 
sequencer (Applied Biosystcms Model 470A) was 
used; criteria of identification of phenylthiohy- 
dantoin derivatives of the amino acids have been 
described [14]. 

Relative concentrations of the L chain and its two 
fragments were estimated from densitomctric 
tracings of the Coomassie blue stained bands in the 
cylindrical gels at 560 nm using a Gilford linear 
transport scanner (model 24105) running at 2 cm/ 
min, with recorder chart speed at 5.01 mm/min. 
The areas in the traced peaks, measured with a 
compensating polar planimcter (Kcuffcl & Esscr 
Co., Morristown, NF), represented relative protein 
concentrations 1 1 1|. 
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Results 



Progress of breakdown of the 53 kDa L chain of 
type A NT into -28 kDa and -18 kDa 
fragments during 2, 7, 22 and 45 days at 8°C in 
0.02 M phosphate buffer pH 6.0, 5 mM DTTand 
60% ammonium sulfate saturation is shown in 
Figures 1 and 2 (left panels). Analysis of the 
145 kDa dichain type A NT stored over compa- 
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Fig. 1. Photographs of 6 mm cylindrical gels show type A L 
chain (left panel, gels 1-5), type A NT (middle panel, gels 
1 -3) and type E L chain (right panel, gels 1-3) and the pro- 
gress of formation of -28 and -18 kDa fragments. The left 
panel shows appearance of -28 and -18 kDa bands at -50 
and -60 mm, and the type A L chain at 35 mm. The L chain 
immediately after isolation and after 2, 7, 22, and 45 days 
arc shown in gels 1-5. respectively (densitometry scans of 
these gels arc shown in Fig. 2 left panels a-e, respectively). 
Direction of migration of bands in gels top to bottom 
(anode) corresponds to peaks migrating left to right in 
charts. L chain samples in gels #1-4 were from NT batch 
#34; gel #5 had L chain isolated from NT batch #31 . Loads 
of protein on gels #1-4 were -0.02 mg in 400 /nl, gel #5 
received -0.06 mgin 150/xi. Because the gels were prepared 
and run on different days, the locations of the bands do not 
align perfectly between the gels. Also, the intemity of the 
bands varies between these gels because they, were stained 
and destaincd on different days and thus nonutentically. The 
middle panel shows that type A NT stored for 14 (gel 1), 21 
(gel 2) and 42 days (gel 3) dissociated into H and L chains 
as bands at 18-22 mm and 35-37 mm respectively. Gels I 
and 2 were loaded with -0.03 mg protein in 1(H) gel 3 
w;i> loaded willi Iwicc llic miutunl. Trace impuriticr. ::rc v:r.i 
ble at high protein load, but the -28 and - 18 kDa fragments 
are not seen. The right panel shows type EL chain immedia- 
tely after isolation (gel 1) and after 27 (gel 2) and 74 days 
storage (gel 3). The -28 kDa and -18 kDa fragments arc 
at 47-50 mm and 55 - 60 mm, respectively. Gels 1, 2 and 3 
were loaded with -0.03. -0.04 and -0.03 mg protein in 
4(H) p\. NT batches JG-2, 49 and JG-1 were used for gels 1, 
2 and 3, respectively. Protein samples were boiled 5 min in 
the presence of SDS and mercaptoetliatiol |15J. The 7.5% 
gels ( 1 5 mg acrylamidc plus 390 mg A\ /V'-mcthylenebisacry I- 
amidc in 200 ml buffer) were run 5.5 h at 9 mA/ gel. 



rable period did not yield these two fragments 
(Figs. 1 and 2, middle panels). The isolated 
97 kDa H chain also did not yield these 
fragments (not shown). 

Three separately purified preparations of the 
type A NT (batch #30, 31 and 34) were used as 
independent source materials to isolate the L 
chain. In all cases the pattern of the L chain 
breakdown was similar. The — 18 kDa 
fragments generated in these separate prepara- 
tions in 16 days (batch #34), 30 days (batch #30) 
and 45 days (batch #31) and the -28 kDa 
fragment from one preparation (batch #30, 30 
days) were examined for partial amino acid 
sequences. The freshly prepared L chain, not 
stored in pH 6.0, was also partially sequenced. 
The replicate samples of — 18 kDa fragment had 
identical sequences (Fig. 3). 

Fragmentation of the L chain never reached 
completion in spite of manipulation of the 
storage conditions in terms of time (up to 45 days 
for type A and 74 days for type E), temperature 
(room temperature vs. 8°), pH (8.4 vs. 6.0), 
presence or absence of ammonium sulfate at 
60% saturation or 2 M urea. Storage of the L 
chain in the presence of protease inhibitors such 
as soybean trypsin inhibitor (inhibitor:!, chain; 
5: 100 (w / w) , PMSF (2 mM) and aprotinin (each 
0.5% w/v) did not prevent or significantly 
retard progress of fragmentation. The relative 
proportions of the 53 kDa L chain to the 
fragments (i.e., -28 kDa plus -18 kDa 
fragments) estimated from 3 separate experi- 
ments were as follows: 1:1.93 (batch #31, 45 
days old); 1:1.73 (batch #31, 31 days old), and 
1:1.27 (batch 34, 16 days old). The ratios of 28 
kDa and 18 kDa fragments in these samples 
were 1:1.61, 1:1.03 and 1:1.63, respectively. 

The 50 kDa L chain subunit of type E NT, 
following its separation from the H chain was 
also found to break down into -28 kDa and -18 
kDa fragments (Figs. 1 and 2; right panels). Two 
separately purified batches of type E (batch 
#JG-1 and JG-4) were used as independent 
source material to isolate the L chain and 
analyze its breakdown. The preparations were 
stored at 8°C in 2 different ways: i) in 0.02 M 
phosphate buffer, pH 6.0, in the presence of 
5 mM DTT and 60% saturation of ammonium 
sulfate (39 g/100 ml Ultrapure, Schwartz- 
Mann) for 74 days; and ii) in borate -phosphate 
buffer pH 8.4 (see [4] for exact composition) in 
the presence of 5 mM DTT and 60% saturation 
of ammonium sulfate for 48 days (urea was 
removed by dialysis against borate -phosphate 
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isitometric tracings of the cylindrical gels (in Fig. 1 ) loaded with type A L chain (left panels, a-e), type A NT (middle 
r) and type E L chain (right panels, a-c. Left panels a-e correspond to gels 1-5 in Fig. 1 (left panel). Middle panels 
:ingsof the gels 1-3, respectively, shown in Fig. 1 (middle panel). Right panels a-c represent gels 1-3, respectively, 
eht panel). 



Fig. 2. Dcnsitomctric 
panels, a 

a-c are tracings of the gels 
in Fig. 1 (right panel). 



buffer containing DTT). The -18 kDa 
fragments from these two preparations and 
—28 kDa fragment from one preparation were 
partially sequenced (Fig. 3). The isolated H 
chain, its progenitors the dichain NT (nicked 
with trypsin) and the single chain type E NT 
(before nicking with trypsin) following long 
storage under similar conditions did not yield the 
-18 and -28 kDa fragments (results of SDS- 
PAGE not shown). As mentioned in the case of 
type A NT, manipulation of storage conditions 
did not significantly arrest or promote the 
breakdown of the L chain to completion. At the 
end of 74 days of storage the relative proportion 
of the L chain to the fragments (28 kDa plus 18 
kDa) from the batch #JG-1 was 1:0.9; and the 
ratio of 28 kDa:18 kDa fragments was 1:1.45. 



Discussion 

Examination of the L chain isolated from diffe- 
rent preparations of the— 150 kDa NT that were 
purified from separate batches of bacterial 
culture showed the following consistent results: 
i) the -50 kDa L chain yields -28 kDa and 
-18 kDa fragments; ii) generation of these two 
fragments is time-dependent but fragmentation 
does not go to completion; iii) fragmentation 
results from peptide cleavage (SDS-PAGE and 
amino acid sequence analyses) at a precise site 
(the newly released N-terminal on the —18 kDa 
fragment is identical in replicate samples); 
iv) the sites of peptide cleavage On the L chains 
from types A and E arc only 5 amino acid 
residues apart. Compared to replicate analyses 
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Fig. 3. Partial amino acid sequences of the L chains of botulinum types A and E and their -18 kDa fragment compared with 
tetanus NT. The residues of the light chain ot type A and E botulinum NT underlined match with the underlined residues in 
tetanus NT. Tetanus sequence data from [29]. 



of the -28 and -18 kDa fragments from the 
same preparation of the L chain, our analyses of 
the L chain isolated from separate batches of the 
NT allowed a more rigorous examination of the 
two fragments. Lastly, any conclusion or 
hypothesis that can be entertained regarding the 
cause of peptide cleavage is applicable to type A 
as well as type E. 

Why was the fragmentation of the L chain not 
observed earlier [15, 16, 17]? The L chain sub- 
units of type A and E NTs were first identified 
in SDS-PAGE [15]. Each dichain NT not 
reduced with mcrcaptoethanol migrated as a 
single band (-150 kDa); following reduction of 
the disulfide bonds, this band was replaced by 
two bands (-100 and -50 kDa). The -100 and 
—50 kDa chains separated electrophoretically in 
the presence of SDS remained fixed in the gel. 
Next, for amino acid sequence determinations, 
the two subunit chains, following recovery from 
QAE-Sephadex column [4] and precipitation 
with ammonium sulfate, were processed quickly 
for Edman degradation [16, 17]. For amino acid 
analysis [4] the L chain preparations soon after 
isolation were checked for purity by SDS- 
PAGE (see Fig. 1 and 3 in [4], dialyzcd extensi- 
vely against water, lyophilized and then acid 
hydrolyzed. Even if the L chain had fragmented 
the smaller fragment must have remained inside 
the dialysis bag, because the sum of the amino 
acid residues of the lyophilized L and H chains 
added up to the amino acid content of the 150 
kDa dichain NT [4]. 



We do not have a definitive explanation as to 
why the fragmentation occurs. Probable causes 
of the fragmentation, proteolytic or nonproteo- 
lytic, are considered here based on experiments, 
observations and analogies. Although the 
presence of traces of a protease in the NT prepa- 
ration is a possibility which is extremely difficult 
to disprove, the following considerations argue 
against a protease catalyzed fragmentation: 
i) during its isolation, the NT was treated with 
the protease inhibitors aprotinin and PMSF 
twice, i.e. at two different steps of the purifica- 
tion protocol [9]; ii) if a proteolytic activity sur- 
vived the inhibitors and was copurified with the 
NT, the protease did not generate the -28 and 
-18 kDa fragments from the dichain NT 
(composed of L and H chains) or other 
fragments from the isolated H chain; iii) only the 
L chain was fragmented, and then only after 
separation from the H chain. This would be pos- 
sible if the L chain has a unique amino acid 
sequence, not present in the H chain, and beco- 
mes exposed to the hypothetical protease only 
after separation from the H chain; iv) the strains 
of C. botulinum that produce types A and E NT 
are proteolytic and nonproteolytic, respectively 
[8, 18]. This difference in the physiology 
between the two strains is also evident from the 
facts that: type A NT isolated from 96-hr old 
cultures is a dichain protein (the single chain 
protein is nicked by an endogenous protease at 
1 / 3 the distance from the N-terminal, see [4, 16, 
17]); and the type E NT is recovered as a single 
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chain molecule. Therefore the type E NT isolat- 
ed from non-protcolytic culture would be less 
likely to have the trace protease considered in 
the case of type A NT. Trypsin, following 
nicking of the single chain type E to the dichain 
form, was inactivated with soybean inhibitor 
and /or PMSF or removed by using soybean 
trypsin -inhibitor- agarose (Sigma). The dichain 
type E was the source of the L chain. As men- 
tioned above, a trace of trypsin could survive the 
inhibitors but then its proteolytic activity must 
be confined to a unique peptide bond present 
only when the L chain is isolated, rather than 
combined with the H chain; v) fragmentation of 
the L chain from type A or E has never reached 
completion or near completion (sec Fig. 2 and 
ratio of peaks) in spite of manipulation of 
solvent, temperature, pH and time. 

An explanation alternative to the protease 
catalyzed fragmentation is that an intrinsic 
property of the L chain polypeptide causes its 
breakdown. Studies of various proteins indicate 
that a simple code in a polypeptide, i.e. certain 
amino acids and specific sequences, can be the 
determinant of the polypeptide's degradation. 
These studies led to the formulation of the N- 
terminal rule of Varshavasky [19], the PEST 
hypothesis [20] and the identification of the 
-KFERQ-sequence [21] and of both aspartyl 
and asparaginyl peptides as possible hot spots 
for nonenzymatic breakdown of proteins {22j. 

The spontaneous degradation at physiological 
pH and over a range of temperatures was noted 
in: i) the hexapeptide -Val-Tyr-Pro-Asn- 
Leu-Ala- (cleavage between Asn and Leu) 
modeled after sequences of residues #22-27 
(- Val-Tyr-Pro-Asn-GIy- Ala-) of adreno- 
corticotropic hormone [22]; and ii) the or A sub- 
unit of bovine a-crystallin (cleavage at residue 
Asn- 101; [23]). The cleavage in both cases 
occurs following the same pathway, viz. cycliza- 
tion of Asn to succinimide ([22-24]; and Fig. 1, 
in [23]). These examples and discussions 
[22-25] allow us to conjecture that the cause of 
slow and incomplete breakdown of the L chain 
is perhaps formation of a succinimide ring from 
certain Asn or Asp residues, which are plentiful 
in the NT [4]. The validity of this proposal could 
be tested using sophisticated and demanding 
protein chemistry techniques; for example, the 
truncated L chain, i.e. the -28 kDa fragment, 
would be expected to have C-terminal aspara- 
gine and also C-terminal aspartic acid amide, 
because hydrolysis of the labile succinimide takes 
two paths (sec structures VI and VII in Fig. 1 of 



|23]). Detection of a C-terminal aspartic acid 
amide, as Voortcr et ai [23]) have suggested, 
would also eliminate the possibility that a 
protease cleaves the L chain into —28 and 
— 18 kDa fragments. 

Why docs the L chain break down only after 
its separation from the H chain? A probable 
answer is found in Clark's explanation [24]: "asp 
and asn residues generally exist in native 
proteins in conformations where the peptide 
bond nitrogen atom cannot approach the side- 
chain carbonyl to form a succinimide ring -suc- 
cinimides may largely form in denatured 
or partially denaturatcd proteins in which 
rotation around the main chain (psi angle) and 
side chain (chi angle) dihedral angle allow intra- 
molecular imide formation at sites which are 
conformational^ unfavorable in the native 
structure". Clark [24] also pointed out that 
"main chain conformations that are conducive to 
succinimide attack (psi = -120°) are relatively 
rare in proteins" but in type IT beta hairpin turns 
this conformation is found for the amino acid 
residue in the second position. Relevant to this, 
we note that: i) the L chain, following its separa- 
tion from the H chain in the presence of 2 M urea 
and DTT, may not be in a conformation identi- 
cal to when it is linked to the H chain via disul- 
fide and noncovalcnt bonds; and ii) secondary 
structure analysis shows a large content of /3-turn 
in the L chain; 18.75% compared to 13.00% in 
the H chain and 15.25% in the NT [26]. 

The structural similarity between botulinum 
and tetanus NTs, first pointed out in 1977 [27] 
has became more evident with the availability of 
their amino acid compositions and partial amino 
acid sequences ([4] and refs. therein). The L 
chains of the botulinum and tetanus NTs inhibit 
exocytosis of catecholamine from chromaffin 
cells [7, 28]. The complete amino acid sequence 
of tetanus NT, deduced from the nucleotide 
sequence, shows that the mature protein 
(-150,700 Da) and its light (-52,288 Da) and 
heavy (98,300 Da) chain regions are composed 
of 1314, 456 and 858 amino acid residues, respec- 
tively [29]. Characterizations of the —28 and 
—18 kDa fragments from type A and E botuli- 
num NTs based on partial amino acid sequences 
are discussed in the context of tetanus NT. The 
first 20 residues of the —28 kDa fragment from 
botulinum type A NT matched exactly with the 
N-terminal sequence of its L chain, proof that 
this fragment contains the N-terminal region of 
the L chain. The sequence of residues 1-73 of 
the intact L chain includes residues 1-17 identi- 
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fied earlier [4, 16]; residues 1-46 and residues 
1 -73 has been described in preliminary reports 
[30, 31]. The residues underlined in Figure 3 are 
positionally identical to those of the L chain of 
tetanus NT. The partial sequence of the 
-28 kDa fragment from type E L chain also 
matched completely with that of the first 20 
residues of 50 kDa E L chain [17]. 

The -18 kDa fragments from types A and E 
align with the L chain of tetanus NT beginning 
at its residue #259 (Tyr) and #264 (Tyr), 
respectively (Pro, the N-tcrminus of tetanus NT 
is counted as #1 residue). This region of the 
tetanus NT contains the C-terminal half of its L 
chain (half of 456 residues). The two -18 KDa 
fragments can therefore be considered to corres- 
pond to the C-terminal half of the L chains of 
botulinum NT types A and E. Homology based 
on positional identity alone between types A and 
E is 3 1 % ; between botulinum type A and tetanus 
38.2%; and between botulinum type E and teta- 
nus 48.8%. Homology increases if single nucleo- 
tide base substitution is considered (not shown 
here). 

Fragmentation of the isolated L chain of botu- 
linum NT is not restricted to types A and E. We 
have observed the same in botulinum NT type B, 
which is currently under investigation. 

The mode of action of botulinum NT is like 
those of other dichain toxic proteins (e.g. diphte- 
ria, ricin) in that its L chain after entering the 
cell appears to cause the metabolic lesion [7]. 
Whether the L chain separates from the H chain 
in vivo is not yet known, but the slow in vitro 
fragmentation reported here warrants conside- 
ration as in the studies where the isolated L chain 
was used [5-7]. 
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B tulism is widely known to result from ingestion of food containing botulinum neurotoxin produced in situ 
by certain strains of Clostridium botulinum. Infant botulism caused by C. botulinum, unlike the food-borne 
intoxication, is the toxkoinfectious form of botulism (S. S. Ai nun, p. 331-345, in G. E. Lewis, ed., Biomedical 
Aspects of Botulism, 1981). The strain of Clostridium baratii implicated in infant botulism produced a 
neurotoxin that was neutralized with antiserum for botulinum neurotoxin serotype F (J. D. Hall, L. M. 
McCroskey, B. J. Pincomb, and C. L. Ha the way, J. Clin. Microbiol. 21:654-655, 1985). We developed a 
procedure to culture the toxigenic C. baratii (strain 6341) in dialysis bags and a simple purification scheme 
(precipitation of 900-ml culture supernatant with ammonium sulfate and two anion-exchange chromatographic 
steps at pH 5.5 and 8.0) that yielded up to 150 u.g of purified neurotoxin. It is an - 140-kDa single-chain protein 
and has the following sequence of amino acid residues at the N terminus: Pro- Val-Asn-Ile-Asn-Asn-Phe-Asn- 
Tyr-Asn-Asp-Pro-Ile-Asn-Asn-Thr-Thr-IIe-Leu. Comparison of this amino acid sequence with those of the 
botulinum neurotoxin serotypes A, B, and E showed 40 to 50% identical residues in comparable positions. The 
specific toxicity of the neurotoxin, ~2 x 10 6 50% lethal doses for mice per mg of protein injected, was not 
enhanced significantly by mild trypsinization, although the protease cleaved the neurotoxin within a disulfide 
loop that generated at least two primary fragments, -47 and -86 kDa, that remained linked by an interchain 
disulfide. These two fragments resembled the light and heavy chains of the well-characterized neurotoxin 
serotypes A, B, C, D, E, and F produced by C. botulinum. 



Clostridium botulinum and C. tetani have been well rec- 
ognized, until recently, as the only two bacterial species that 
produce the two extremely poisonous proteins— botulinum 
and tetanus neurotoxins (18). The former causes flaccid 
paralysis in botulism, and the latter causes spastic paralysis 
in tetanus (16). Botulinum neurotoxin is found in nature as 
seven antigenically distinguishable proteins (serotypes A, B, 
C lt D, E, F, and G); tetanus neurotoxin occurs as a single 
serotype. The neurotoxins of these two Clostridium species 
have strikingly similar primary structures, structure-function 
relationships, and modes of action; however, their primary 
sites of action are different. Botulinum neurotoxin acts at 
neuromuscular junctions, and tetanus neurotoxin acts at 
inhibitory synapses in the central nervous system (for re- 
views of the pharmacology of these two proteins, see refer- 
ences 16 and 27). 

Certain strains of C. baratii and C. butyricum responsible 
for human infant botulism (2) cases were isolated around 
1985. The neurotoxins produced in cultures of these clostrid- 
ial species were neutralized by anti-botulinum neurotoxin 
sera (17, 24); the genetic identities of the C. baratii and C. 
butyricum strains were confirmed (32). These findings have 
profound implications, one of which is that non-C. botuli- 
num species found in the normal flora of infant intestines 
may acquire and express the neurotoxin gene. This elicited 
for us an immediate question: how do the neurotoxins 
produced by C. baratii and C. butyricum compare structur- 
ally with those produced by C. botulinum'} The neurotoxin 
elaborated by C. butyricum was purified (15). and its partial 
amino acid sequence (13) and molecular topography (28) 
were determined; these and a few other features were 
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compared with those of botulinum neurotoxin type E (13, 15, 
21. 28). We now report partial characterization of the neu- 
rotoxin produced by C. baratii 6341 (to be referred to 
henceforth as baratii neurotoxin) in t rms of (i) molecular 
size, (ii) partial amino acid sequence, (iii) toxicity (50% lethal 
doses [LD 50 ] for mice per mg of protein injected), and (iv) 
enhancement of toxicity (activation) after mild trypsiniza- 
tion. We also compare these features with those of the 
botulinum neurotoxins. 



MATERIALS AND METHODS 

Culture media. Four different media of the following 
compositions were used to maintain the stock culture and to 
produce the neurotoxin: medium A, 12.5% solid meat media 
(Difco), 0.3% glucose, 0.2% soluble starch (Difco), and 
0.05% sodium-thioglycolate (Sigma): medium B, 2% Trypti- 
case-peptone (BBL, Becton Dickinson Microbiology Sys- 
tems), 1% glucose, 0.5% yeast extract (BBL), 0.2% soluble 
starch, and 0.05% sodium-thioglycolate; medium C, 1% 
glucose and 0.05% sodium-thioglycolate; medium D, 4% 
Trypticase-peptone, 1% yeast extract, and 0.05% sodium- 
thioglycolate. In each case all the ingredients were dissolved 
(wt/vol) in 0.1 M sodium potassium phosphate buffer, pH 
7.0. 

Bacterial culture. One milliliter of C. baratii (strain 6341), 
grown in cooked-meat medium and kindly provided by 
Charles Hatheway (Centers for Disease Control, Atlanta, 
Ga.h was inoculated into 250 ml of medium A in a 500-ml 
flask and incubated for 5 days at 37°C. The bacterial cells in 
the culture fluid were packed into a small volume by centrif- 
ugation (10.000 x 20 min). suspended in sterile 20% skim 
milk (Difco; dissolved in water), and then distributed in 2-ml 
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TUBE NUMBER 

FIG. 1. Chromatography of culture supernatant concentrate on a 
DEAE-Sephadex A-50 column (2.5 by 60 cm) equilibrated and 
eluted with 50 mM sodium citrate buffer. pH 5.5. Chromatography 
was done at room temperature. The flow rate was 25 ml/h. and the 
fraction volume was 2.4 ml. The fractions containing the first peak 
were pooled and are indicated with a bar. 



vials as stock culture. These were stored at -70°C or 
lyophilized and stored at 4°C. 

The neurotoxin was produced in dialysis bags (31) with 32- 
or 100-mm-flat-width and 100-cm-long dialysis tubings 
(Spectrapor; D1615-2, 12- to 14-kDa cutoff, or 132670, 6- to 
8-kDa cutoff limit). Approximately 30 ml of medium B in a 
50-ml screw-cap glass tube was inoculated with 0.2 ml of 
stock culture (thawed or rehydrated with medium B). After 
-12 h of incubation at 37°C, 1 ml of this culture was 
transferred to 30 ml of medium B in a glass tube and further 
incubated overnight at 37°C. One milliliter of this culture was 
inoculated into 150 ml of medium C contained in a dialysis 
bag which was immersed in a 1,000-ml glass cylinder con- 
taining 500 ml of medium D. After 5 days of incubation at 
37°C, the contents of dialysis tubes from six parallel cylin- 
ders were pooled and centrifuged (10,000 x 60 min). The 
supernatant was 60% saturated with ammonium sulfate (390 
mg/ml) and kept at l C overnight. Our one attempt to harvest 
more neurotoxin from a larger volume of culture fluid by 
using the larger-diameter dialysis bag (100 versus 32 mm) of 
identical length (100 cm) gave lower yield. Porosity of the 
bags (i.e., molecular size cutoff and volumes inside and 
outside of the bags were different; whether these were 
factors in the neurotoxin yield remains to be explored. 

Neurotoxin purification. The material precipitated with 
ammonium sulfate was recovered by centrifugation (10,000 
x g, 45 min); dissolved in 15 ml of 50 mM citric acid- 
disodium citrate buffer, pH 5.5; and then dialyzed against 
this buffer for several hours at 4°C. The dialyzed solution 
(-25 ml) was centrifuged (15,000 x g, 60 min at 25°C); the 
supernatant was loaded into a DEAE-Sephadex A-50 col- 
umn (2.5 by 60 cm), equilibrated, and eluted with the 50 mM 
citrate buffer, pH 5.5. The UV absorbing materia! (indicated 
with a bar in Fig. 1) eluting as a wide peak ahead of a large 
peak was pooled; dialyzed against 30 mM sodium phosphate 
buffer, pH 7.0, for 5 h at 4 e C; and finally precipitated with 
ammonium sulfate (390 mg/ml). After 16 h at 4°C. the 
precipitate was recovered by centrifugation (10.000 x g, 30 
min, 4°C); dissolved in 5 ml of 20 mM sodium phosphate 
buffer. pH 8.0; and dialyzed against this buffer for 16 h at 
4°C. Turbidity that appeared during dialysis was removed by 
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FIG. 2. Chromatography on a DEAE-Sephadex A-50 column 
(1.8 by 8 cm) equilibrated and eluted at 4°C with 20 mM sodium 
phosphate buffer (pH 8.0). Flow late was 12 ml/h, and the fractions 
were of 2 ml. The salt gradient, indicated with a dashed line, was 
made with 50 ml of 20 mM sodium phosphate buffer, pH 8.0. and 50 
ml of the same buffer but containing 0.3 M NaCl. The neurotoxin 
eluted isocratically and was recovered in the first peak (neurotoxin 
pool is indicated with a bar). 



centrifugation. The clear supernatant was loaded onto a 
DEAE-Sephadex A-50 column (0.9 by 6 cm) equilibrated at 
4°C with the pH 3.0 buffer. The UV absorbing material that 
did not bind to the column and emerged as the first peak (Fig. 
2) was pooled, precipitated with ammonium sulfate (390 
mg/ml), and stored at 4°C. 

SDS-PAGE. Protein samples were analyzed by sodium 
dodecyl sulfate-polyacrylamide gel H^trophoresis (SDS- 
PAGE; 10% acrylamide, 0.89% cross-linking) in a discontin- 
uous buffer (22) and with the stain Coomassie blue R-250 
(Sigma). Their molecular masses were estimated from the 
relative electrophoretic migration of marker prcteins (phos- 
phorylase b, bovine serum albumin, ovalbumin, and car- 
bonic anhydrase, which were 94, 67, 43. and 30 kDa, 
respectively [Pharmacia!) as well as the well-characterized 
botulinum neurotoxin serotypes A, B, and E (5). 

Eiectroblotting and amino acid sequence deternunsi* ^. 
The baratii neurotoxin after electrophoresis on 6% linear 
acrylamide gel with SDS <1.6-mm-thick, 10- by 10-cm min- 
islab; 40 mA for 90 min) by using the discontinuous buffer 
system (22) was electrotransferred (0.5 A, 60 min) to a 
polyvinylidene difluoride (Millipore, Bedford, Mass.) mem- 
brane by the method of Matsudaira (23) with the minor 
modifications reported before (12). The protein blotted onto 
the polyvinylidene difluoride membrane was stained with 
Coomassie blue, air dried, excised out with a razor blade, 
and stored at -20 C C. For amino acid sequencing. Applied 
Biosystems Instrument pulse-liquid phase (models 475A and 
477A) protein sequencers were used. Phenyhfhiohydantoin 
amino acids were identified with an on-line phenylim'ohydan- 
toin amino acid analyzer md a data analyzer system (Ap- 
plied Biosystems model 120- A and 900- A, respectively). 

Hemagglutination assay. Human and rabbit erythrocytes, 
recovered with citrate as an anticoagulant, were washed 
with 75 mM sodium phosphate buffer. pH 7.2. containing 75 
mM NaCl. Microliter plate (Dispo V plate: Scientific Prod- 
ucts) wells were loaded with 50 u.1 of Q.5?c blood cell 
suspension and 50 u.1 of test proteins serially diluted in the 
pH 7.2 buffer. The hemagglutinin protein chromatographi- 
cally separated from botulinum type A neurotoxin (30) was 
used as positive control. The platen were read for hemagglu- 
tination after 8 h at 6'C. 
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Toxicity and neutralization with antisera. T xicity was 
determined by intraven usinjecti n f 0.1 ml of sr.mple into 
the tail, vein f white mice (-22 g); time (in minutes) until 
death of mice after injection is a functi n of the toxic 
potency (4). Ability of the ..misera prepared against botuli- 
num neurotoxin serotype A, B, E, or F to neutralize baratii 
neurotoxin was examined as follows: 50 u,l of the neurotoxin 
(containing 25 LD^) was incubated with 50 u.1 of anti-type A, 
B, E, or F serum (enough to neutralize 1,000 LD 5? of the 
homologous neurotoxin) for 1 h at 37°C and then injected 
intravenc . ;ly or intraperitoneally in mice. Anti-type A, B, 
and £ neurotoxin sera were produced in rabbits as described 
before (8); anti-type F serum, produced in burros, was a gift 
from C. Hatheway. 

Effects of trypsinization on covalent structure and toxicity. 
Baratii neurotoxin (-0.1 mg/ml) was incubated with trypsin 
(Sigma; type XIH, lot 67F-8045; ratios fwt/wt]. 50:1, 25:1, 
10:1, and 2:1) at pH 6.0 or 7.0 (0.1 M sodium-pnosphate) or 
7.5 or 8.0 (0.1 M Tris-hydrochloride) for 5 to 60 min at 30 to 
35°C. Tryptic digestions were stopped with soybean trypsin 
inhibitor (Sigma; type I-S, lot 85C-8057); enzyme-to-inhibi- 
tor ratio was 1:3 (wt'wt). The digests and the control 
(neurotoxin incubated without trypsin) were diluted with 50 
mM sodium phosphate buffer. pH 6.0, containing 0.2% 
gelatin (Difco) and then tested for toxicity. Trypsinized 
neurotoxins with and without dithiothreitol <DTT) reduction 
were analyzed by SDS-PAGE. 

Protein concentration was determined with bicinchoninic 
acid (29) (by using the protein determination kit from Pierce, 
Rockford, III.). Bovine serum albumin was used as the 
reference protein. 



RESULTS AND DISCUSSION 

C. baratii 6341 , cultured conventionally (i.e., without 
dialysis bag), produced amounts of the neurotoxin (<500 
LD 50 /ml) that were too low for biochemical studies. Dialysis 
bag culturing (31) improved the yield to >5,000 LD 5(1 /ml, 
which was lower than the range from lO* to 10 8 LD 50 /ml 
found in cultures of C. Botulinum types A, E, and F (9, 14, 
33). The baratii neurotoxin was purified (2 x 10 6 LD 50 /mg) 
more than 1,000 times from the culture supernatant (1.5 x 
10 3 LDjo/mg). The average yield of -0.15 mg of purified 
baratii neurotoxin recovered from ~900-ml culture com- 
pared with -1.0, ~1.5, and 0.63 mg of pure botulinum 
neurotoxin types A, E, and F per liter, respectively, from 
conventional culture (9, 14. 3^) indicates low toxigenicity of 
C. baratii 6341 rather than poor recovery of the protein 
through purification steps. 

The SDS-PAGE analysis shows progress through the 
purification steps (Fig. 3). The pool from the pH 5.5 column 
(Fig. 1) had 140% 120-, and 60-kDa protein bands along with 
two or three smaller-size proteins (Fig. 3, lane 3). Chroma- 
tography at pH 8.0 (Fig. 2) removed the 120-kDa and most of 
the other proteins from the 140-kDa neurotoxin. The baratii 
neurotoxin did not bind to a DEAE-Sephadex A-*>0 column 
equilibrated with 20 mM sodium phosphate buf.cr, pH 8.0. 
We tested QAE-Sephadex A-50 and Mono-Q resin (Fast 
Protein Liquid Chromatography; Pharmacia; equilibrated 
with the same buffer. Both were unsatisfactory. The neuro- 
toxin preparation partially bound to the QAE gel and eluted 
as a broad peak under a gradient of increasing NaCI concen- 
tration (not shown). The Mono-Q column bound the neuro- 
toxin: an increasing salt gradient eluted a single peak con- 
taining a significant amount of impurities (not shown). 
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FIG. 3. PAGE pattern of toxin samples. Lanes: 1 and 12, molec- 
ular mass markers; 2. crude baratii neurotoxin before chromatogra- 
phy at pH 5.5 column; 3. neurotoxin purified through DEAE- 
Sephadex at pH 5.5: 4. neurotoxin from the pH 8 chromatography: 
5, 6. and 7. botulinum type A. B. and E neurotoxins, respectively 
(the two extra bands below type A neurotoxin in lane 5 are 
impurities): 8. 10. and 11. baratii type B and E neurotoxins (same as 
in lanes 4. 6 and 7. respectively) after trypsinization and reduction 
with DTT; 9. botulinum type A neurotoxin reduced with DTT. 



DEAE-Sephadex was thereto e chosen for subsequent 
work. 

The neurotoxin obtained from the DEAE-Sephadex col- 
umn (Fig. 3, lane 4) did not change its electrophoretic 
migration after reduction with DTT. Thus, the 140-kDa 
neurotoxin appears to be a single-chain protein and not 
composed of two polypeptides linked by a disulfide bond(s). 
Determination of amino acid sequence of the neurotoxin 
prepared from two independent batches yielded Pro as the 
only N terminus (Table 1). The first preparation was ana- 
lyzed for the first 32 residues, and the second batch was 
analyzed for the first 19 residues: identical results were 
obtained. 

The 120-kDa protein (the band migrating immediately 
lhead of the 140-kDa neurotoxin in lanes 2 and 3 of Fig. 3) 
found associated with the neurotoxin after chromatography 
at i>H 5.5 was removed after chromatography at pH 8,0 (Fig. 
3, lane 4). This protein appears analogous to the nonneuro- 
toxit proteins possessing high, low, or no hemagglutinating 
activity and found tightly associated with all botulinum 
neurotoxin serotypes (25. 30). For example, the nonneuro- 
toxic proteins associated with type A and F neurotoxins 
have nigh and low hemagglutinating activity,, respectively; 
the 134-kDa protein associated with type E has no hemag- 
glutinating activity (30>. These nonneurotoxic proteins have 
other common characteristic features. They remain associ- 
ated with the neurotoxins at acidic pH but can be separated 
at alkaline pH and high ionic strength, and also the nonneu- 
rotoxic proteins elule from un an ion -exchange column a: an 
ionic strength thai is higher than thai which eiuies the 
neurotoxin. Baratii neurotoxin was freed of the 120-kDa 
protein with 20 mM phosphate buffer, pH 8.0. The 120-kDa 
protein was eluted with an increasing salt gradient (Fig. 2). 
No hemagglutinating activity was found in the bacterial, 
culture, concentrated protein samples, or chromatographic 
fractions. Absence of hemagglutinating activity resembles 
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TABLE 1. N -terminal amino acid residues of various Ctouridium neurotoxins 



Neurotoxin Reference 



Amino acid rcudut- 



30 



Homology 



Baratii" 

Botulinum 
Type A 
TypeB 
Type E 

Butyricum 

Tetanus 



P V N l N N F N Y N D P I N N T T 1 L Y M K M P Y 



Y Y D S N K Y 



3 

5 

5 

13 

U 



F.VNKQ 



V T 

- K 

- T 
I T 



N N 

N S 

N S 

N N 



N Y 
R Y 



N D 
N D 
S D 



N N 



D I 
N I 
T I 
T I 
T I 



Y I 
M M 

Y I 

Y I 
M M 



34 
53 
SO 
53 
50 



" Th « amino «W «quenator log is as folkmv the theoretical initial yields (in picomolesi and combined amino acid repetitive yield (in percent! of the first batch 
(32 residues identified; LPS 1351) were and 92*. respectively, and for the >ecund baich .19 roiducs identified: MLK 5-7-90) were lOand 91*. resp«ti«" 



the properties of products of type E culture. The relative 
proportions of baratii neurotoxin and the 120-kDa protein 
did not appear constant in the different preparations: the 
neurotoxin was always in a larger amount. 

Specific toxicity of the purified baratii neurotoxin was -2 
y 10* LD 50 /mg. Anti-botulinum type A or B serum did not 
neutralize toxicity. Complete neutralization of toxicity was 
attained with anti-type F serum, as expected (17). We also 
found that baratii neurotoxin is partially neutralized (delayed 
death in mouse lethality assay) with anti-type E serum 
(Hatheway also noted this (18aJ). 

Toxicity of the baratii neurotoxin was not significantly 
enhanced by trypsin digestion. Limited digestion of the 
neurotoxin with trypsin (20:1 [wt/wtj, 30 min at 35 6 C) 
followed by reduction of disulfide bonds with DTT generated 
two major fragments (86 and 47 kDa) and at least three other 
minor fragments K140. -60, and -43 kDa) as shown in Fig. 
3 (lanes 4 and 8). Higher trypsin concentration or longer 
digestion produced further cleavage of the 140-kDa neuro- 
toxin as well as the 86- and 47-kDa fragments. 

Generation of 86- and 47-kDa fragments from the 140-kDa 
protein by trypsinization anci separation of the two frag- 
ments after reduction with DTT suggest that trypsin cleaved 
the neurotoxin at about one-third the distance from its N or 
C terminus and within a disulfide loop. We infer that 86- and 
47-kDa fragments represent the heavy and light chains, 
respectively, of the neurotoxin. This inferred structure is 
consistent with the structures of different serotypes of bot- 
ulinum neurotoxin as well as tetanus neurotoxin. Each of 
these neurotoxins is synthesized as an -150-kDa single- 
chain protein. Proteolytic cleavage (nicking) converts these 
proteins to - 150-kDa dichain proteins, each composed of an 
M00-kDa heavy chain and an -50-kDa light chain that 
remain linked by noncovalent bonds and a disulfide bond, 
and the light chain retains the N-terminal segment of the 
parent single chain (5). Proteases endogenous to the bacteria 
or exogenous, such as trypsin and endoproteinase Lys-C, 
nick the single-chain neurotoxin to the dichain form (20). 
Botulinum neurotoxin serotypes A. B. and E isolated from 
96-h-old bacterial cultures are. respectively, nicked 
(dichain). partially nicked (mixture of 80 to 909? single and 10 
to 20% dichain). and unnicked (single-chain) proteins (5). In 
Fig. 3 the 140-kDa baratii neurotoxin (lane 4) is compared 
with ~i5G-kDa type A, B, and E neurotoxins (lanes 5. 6. and 
7, respectively); lanes 9. 10, and 11 exhibit the heavy and 
light chains of types A. B, and E. respectively. (Type B and 
E neurotoxins were trypsinized to generate the dichain 
forms (261.) In lane 8 the 140-kDa band represents the 
residual amount of undigested baratii neurotoxin. Our lim- 
ited attempts to generate a homogeneous population of 



dichain protein by confining cleavage of the single-chain 
neurotoxin at only one peptide bond were not successful: 
additional cleavages occurred. The thin band immediately 
below the 140-kDa band is probably a fraction of the 
neurotoxin population that was cleaved near N or C termini 
but was not nicked. One band located between the heavy 
?nd light chain and the one below the light chain are products 
of cleavages at sites other than nicking. Similar additional 
fragmentations, in addition to nicking, have been observed 
with type B and E neurotoxins treated with trypsin and 
endoprr .einase Lys-C (20. 26). 

A sh rt stretch at one-third the distance from the N 
terminus of botulinum type A and E and tetanus neurotoxins 
is rich in Arg and Lys residues (1. 7. 12). Trypsin or the 
protease endogenous to the neurotoxin-producing bacteria 
cuts multiple bonds within a range of few residues. Homo- 
geneity of the N terminus of the heavy chain therefore 
depends on precise conditions of proteolysis and the end 
point of proteolysis U. 7, 12). The S6-kDa heavy and 47-kDa 
light chains of baratii neurotoxin produced by trypsinization 
(Fig. 3, lane 8) were heterogenous (SDS-PAGE showed 
more than a single band): therefore, we did not determine 
their N-terminal sequences. 

The 86-kDa fragment generated by trypsinization (Fig. 3) is. 
we believe, the heavy chain segment of the neurotoxin. After 
verifications of its identity, this material could provide inter- 
esting opportunities in the structure-function relationship 
studies. Electrophoretic migration (Fig. 3) indicates that 
baratii neurotoxin heavy chain is smaller in molecular mass 
than those of types A. B. and E, while the baratii neurotoxin 
light chain is smaller than that of type A. similar to that of type 
B, and larger than that of type E. The heavy chain appears to 
have at least three roles in the neurotoxin s pathway of 
intoxication, (i) In the 150-kDa dichain neurotoxin the -50- 
kDa light chain remains linked to the N-terminal half of the 
heavy chain: unless the heavy chain delivers it into the 
cytosol. the light chain cannot poison the neurotransmitter 
secretory cells, (it) The C-termina! half of the heavy chain is 
thought to have the binding vte for receptors on the target cell 
membrane (ft, 19). (iii) After receptor-mediated endocylosis. 
the neurotoxin, inside the cell, initially remains encapsulated 
by the endosomes. The N-terminai half of the heavy chain 
forms channels in the endwsomaf irienibrane through which 
the light chain exits into the cytosol (see reference 6 and 
references therein). The baratii neurotoxin heavy chain is 
smaller than the heavy chains of type A, B. and E neurotox- 
ins: therefore, its struriure-function analysis would be highly 
instructive in defining which precise segments of the heavy 
chain play what role. 
The baratii neurotoxin purified from 4-day-oM cultu/e was 
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in the single-chain form; this feature differs from the dichain 
type F neurot xin is lated from 4- 1 5-day-oId culture of 
proteolytic strains f C. botulinum type F (10, 33, 34). 

The mechanisms by which C. baratii (strain 6341) acquires 
the neur toxin gen? and expresses the pr tein are unl.;n v .-»n. 
The amino acid sequence reported here can be exploited to 
construct DN A probes which could be valuable in answering 
some of the genetic questions regarding the location of the 
neurotoxin gene and its exchange or acquisition by the 
nonneurotoxic C. baratii and toxigenic C. botulinum. 
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SUMMARY:Recently, it has been shown that two Clostridium butyricum strains 
(ATCC 43181 and ATCC 43755), isolated from cases of infant botulism, produce a 
botulinal neurotoxin type E (BoNT/E). Here we have determined the nucleotide 
seq : ,-3nces of the BoNT/E genes of these two C. butyricum strains and from C. 
botulinum E strain Beluga. We show that the sequences of the BoNT/E genes from 
the two C. butyricum strains are identical and differ in only 64 positions resulting in 
39 amino acid changes (97% identity at the amino acid level) from that derived from 
C. botulinum. Our data suggest a transfer of the BoNT/E gene from C. botulinum to 
the originally nontoxigenic C. butyricum strains. © 1992 Academic Press, Inc. 



The clostridial neurotoxins are highly potent protein toxins that inhibit 
neurotransmitter release at various synapses. These neurotoxins consist of tetanus 
toxin (TeTx) and seven serologically distinct botulinal neurotoxins designated 
BoNT/A, B, C1, D, E, F, and G, all of which are both structurally and functionally 
closely related. All of them are synthesized as single chain polypeptides of about 
15u kDa which are. proteolytically activated into di-chain derivatives constituted of a 
light (L) (Mr ap. 50 000), and a heavy (H) chain (Mr ap. 100 0000) linked by a 
single disulfide bridge (1). 
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A three step model has been proposed to explain the development of 
neurotoxicity (2). This model involves binding, internalization and intraneural 
sorting, and the actual poisoning of the nerve terminal. The heavy chains med.ate 
the binding of toxins to cell receptors, while the light chains appear to enter targe! 
cells and cause some internal changes (3,4). It is not known how the different heavy 
chams reach and affect different kinds of synapses, nor what is the molecular nature 
of the change caused by the light chains. However, these questions may be 
approached by sequence analyses and the identification of conserved doma.ns. The 
genes encoding TeTx, BoNT/A, C1 , and D have been sequenced previously (5, 6 7. 
8 9) It has been shown that the BoNT/E gene is chromosomally located .n C 
botulinum E strain Beluga, and a partial sequence was established by domng and 
sequencing of an EcoRI fragment (7). 

Recently, it has been reported that Clostridium strains quite d.fferent from C. 
botulinum can produce BoNT. Thus, two Clostridium strains isolated from infan 
botulism, and identified as C. outyricum were shown to synthes.ze synthesize 
BoNT/E (10, 11). A comparison of purified BoNT/E derived from C. botu„num^ 
outyricum showed that both toxins are very similar. Their molecular we.ghts (145 
000 Da) determined by polyacrylamide gel electrophoresis were in good agreement 
with that calculated from the deduced amino acid sequences. In addition, both toxns 
exhibited similar specific toxicity in mice (1 2). 

Here we have determined the complete nucleotide sequences of the BoNT/E 
genes from C. Botulinum E strain Beluga, and from the toxigenic C outyricu* 
strains ATCC 43181 and ATCC 43755, and we present the deduced ammo aod 
sequences. 

MATERIALS AND METHODS 

Baca,,*! DNA an* ^^^^^V^'SS^lX 

a bulyriaw strains ATCC 43181 and ATCC 43755 receiver i o y 

Escherichia coli strain TG1 . 

sperm DNA.in 1 M NaCI, 10% dextran sulfate,. 0.5 /„ SDS 50_mM j n,s ,y> 

mixture overnight at 40°C. Filters were wdbnw 
two hours and exposed overnight to Fuji RX films. 
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u rflQ Minn amDlification.C. butyricum DNA (100ng) was 
polymerase chain and primers deduced from the 

Sified using , the polym ase can rea .on (^R) ^ ^ done jn a ^ 

*°Tr \ 00 ^ SlP 10 mM beta-mercaptoethano., 25 pmol of each primer, 
°- 1% o t J of Taa polymerase (Beckman, Paris, France). Reaction mixtures were 
and 2 C V* at 95°C for 2 min and then submitted to 30 subsequent cycles 
d enaturated a : 95 0 to r * mm annealing (20s at hybridization 

^ nS,Stl ?VwhK me?ting temperature of the 

temp erature which [^^ ^^ DNA Ther mal Cycler version 2.2 (Perking 

r:et^ products were purified by 

Geneclean (Bio 101 Inc., La Jolla, USA) and sequenced. 

^miflr taioloaical techniques.Ligation reactions and preparation of 
0ther as described by Maniatis at at (14 ) 

plasma DNA trom t. con we e . T4 polynucleotide kinase and calf 

intestinal phospnatase were iruiu a seauenced by the dideoxy-cham- 

zz^s^X^™ ^ s,a,es Blochemical 

Corporation, Cleveland, USA). 

RESULTS AND DISCUSSION 

We used a synthetic 42mer oligonucleotide representing the nucleotide 
sequence trom positions 716 to 757 of the previously published EcoR. fragment o 
the BoNT/E gene to identify and Cone an overlapping 1373 bp Ns.l fragment. Th.s 
t J then used to clone a Sea. fragment (1639 bp) to yield pMR^ an 
with the help of the latter Cone, pMRP46 containing an Hind.ll fragment of 250 bp 
was obtained. Together, these four Cones encompass the entire coding reg.on of the 
BoNT/E gene from C. botulinum E strain Beluga. The nucleotide »1 
bp) .as been entered in the EMBL database under the access.cn number X62089. U 
contains a sing.e open reading frame of 3753 nucleotides. The deduced ammo ac.d 
sequence (1 251 residues) is shown in Fig. 1 . 

Overlapping DNA fragments of 400 to 500 bp were amplified from C. 
butyricum strains ATCC 43181 and ATCC 43755 DNA by PGR using 
oligonucleotides the sequences of which were deduced from the C. botulinum E 
Br 'IT/E gene. The nucleotide sequences of the BoNT/E genes from the two C 
toyricum strains were determined to be identical and have been entered .n the 
EMBL database under the accession number X62088. 

A comparison of the DNA sequences established for the BoNT/E genes from 
the C botulinum and the C. butyricum strains revealed differences in 69 portions. 
64 of them in the coding region and 5 in the 5" noncoding region. Thirty n.ne ammo- 
acid changes were detected, 19 of them (48%) being located withm the 198 N- 
t ,minal residues which represent 16% of the entire sequence. Amino** dent-t.es 
between C. botulinum E and C. butyricum BoNT/E sequences are 95% for the L 
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- - T N - - - 

31KSFNIMKNIWIIPERNVIGTTPQDFHPPTS 

61LKNGDSSYYD.PNYLQSDEEKDRFLKIVTKI 

91FNRINNNLSGGILLEELSKANPYLGNDNTP 

D R 

121 DNQFHIGDASAVEIKFSNGSQDILLPNVI I 

- G D -■ I - N ? - Q S 

151 MGAEPDLFETNSSNI SLRNNYMPSNHRFGS 

181IAIVTFSPEY5FRFNDNCMNEFIQDPALTL 
211 MHELIHSLHGLYGAKG1TTRYTITQKQNP, L 
241 ITNIRGTNIEEFLTFGGTDLNI ITSAQSND 
271 IYTNLLADYKKIASKLSKVQVSNPLLNPYK 
301DVFEAKYGLDKDASGIYSVNINRFNDIFKK 
331 L YSFTEFDLRTKFQVKCRQTY IGQYKYFKL 
361 SNLLNDSIYN I SEGYNINNLKVNFRGQNAN 
391 LNPRI ITPITGRGLVKKI IRFCKNIVSVKG 
421IRKSICIEINNGELFFVASENSYNDDNINT 
■ 451 PKE1DDTVTSNNNYENDLDQV I LNFNSESA 
481PGLSDEKLNLT1QNDAYIPKYDSNGTSDIE 
511 QHDVNELNVFFYLDAQKVPEGENNVNLTSS 
541 I DTALLEQPKIYTFFSSEFINNVNKPVQAA 
- 571 LFVSW I QQVLVDFTTEANQKSTVDKI AD I S 
601 1VVPY IGLALN 1GNEAQKGNFKDALELLGA 
631 GI LLEFEPELL1 PTI LVFTIKSFLGSSDNK 
661NKVIKAINNALKERDEKWKEVYSF1VSNWM 
691 TKINTQFNKRKEQMYQALQNQVNAIKT1 IE 

721 SKYN SYTLEEKNELTNKY D I KQ I ENELNQR 
________ - E ---- 

Fig. 1 . Alignment of the deduced amino-acid sequences of BoNT/E derived from 
C. botulinum E strain Beluga (upper line), and C. butyricum strains ATCC 43181 o 
ATCC 43755 {lower. line). Dashes represent identical amino acids. 
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781 KLREYDENVKTYLLNY I IQHGSI LGESQQE 

811 LNSMVTDTLNNS I PFKLSSYTDDR I LI SYF 

841NKFFKR I KSSSVLNMRYKNDKYVDTSGYDS 

871 NININGDVYKYPTNKNQFGIYNDKLSEVNI 

901 SQNDYI IYDNKYKNFSISFWVRIPNYDNKI 

931VNVNNEYTI INCMRDNNSGWKVSLNHNEI I 

961WT. FEDNRG I NQKLAF'NYGNANGI SDY I NKW 
- - L Q - - S - - -- 

991 IFVTITNDRLGDSKLYINGNLIDQKSILNL 
1021 GN IHVSDN I LFKIVNCSYTRYIGIRYFNIF 
1051 DKELDETE I QTLYSNEPNTN.I LKDFWGNYL 
1081 LYDKEYYLLNVLKPNNFIDRRKDSTLS INN 
1111 I RST I LLANRLYSG IKVKIQRVNNSSTNDN 
1141 LVRKNDQVYINFVASKTHLFPLYADTATTN 
1171 KEKT I K I SSSGNRFNQVVVMNSVGNCTMNF 

1201 KNNNGNN IGLLGFKADTVVASTWYYTHMRD 

1231 HTNSNGCFWNFI SEEHGWQEK 1251 
N-----F-- _______ 

Fig. 1 - continued 



chain, and 98% for the H chain. The molecular weights of the predicted 
polypeptides are 143 836.70 Da for BoNT/E from C. botulinum E, and 143 389.33 
Da for BoNT/E from the C. butyricum strains, respectively. 

The deduced amino-acid sequences are in close agreement with partial 
sequences determined for the N-termini of the L and the H chains of BoNT/E from 
C. botulinum and C. butyricum (1). As shown by Das Gupta (1), the H chain of 
BoNT/E from C. botulinum E and C. butyricum begins at Lys in position 423. Thus, 
the Cys residues located at positions 412 and 426 are probably involved in the 
disulfide bridge between the L and H chains. 

Fujii et al. have previously reported the nucleotide sequences of the EcoRI 
fragment encoding the 5'-terminus of the BoTN/E gene from C. botulinum E strains 
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Mashike, .wanai, and Otaru. and from C. butyricum strain BL6340 (16. 17). Several 
differences were found between these nucleotide sequences and the ™»>P°^0 
sequences of C. botulinum E strain Beluga and C. butyricum strains ATCC =43181 
and ATCC 43755. As reported in this study, the Beluga strain produces a BoNT/E 
that contains an Arg at position 177, Cys at 198, and Lys at 230. The Japan* , 
BoNT/E reference strains contained a neurotoxin with Gly at posit.on 177, Ser at 
198, and Met at position 230. The presence of Met in this latter position was also 
detected in BoNT/E from the C. butyricum strain BL6340 (17). Taken together, these 
Undings indicate that botulinal neurotoxins, although belonging to the same 
toxinotype, may exhibit minor differences in their amino acid sequences. 

BoNT/E (1251 residues) is significantly shorter than TeTx (1315 residues), 
BoNT/A (1296 residues), BoNT/C1 ( 1291 residues) and BoNT/D (1276 residues). 
the amino-acid level, BoNT/E from C. botulinum has an overa.l identity of 38.3% w,th 
TeTx 44 0% with BoNT/A, 38.3% with BoNT/C1 , and 32.0% with BoNT/D. 

' Functional domains which constitute the receptor binding sites on the H 
chains or which are involved in the as yet unspecified toxification process med.ated 
by the L chains are expected to be conserved. An alignment of the ammo acd 
sequences of the L chains of TeTx, BoNT/A, B. C1, D. and E revealed that mdeed 
highly conserved domains are separated from each other by short variable regies 
composed of 10 to 30 unrelated residues (18 ). In this respect, it is noteworthy that 
only 1 out of the 19 amino acid exchanges found in the N-terminal portions of the 
two BoNT/E L chain sequences involves a residue that is conserved in the other 
neurotoxins, while 10 of the replacements reside in such variable reg.ons. It ,s 
possible that these regions define the immunologic differences of the ind.v.dua 
toxinotypes. This would explain why only five out of nine monoclonal ant.bod.es 
raised against BoNT/E from C. botulinum E reacted with the neurotoxin from the 

butyricum strains (19). 

The His-rich motif in the center of the BoNT/E L chain involving His a 
positions 212. 216. and 219, is conserved in the sequences of all clostrid.ai 
neurotoxins analysed so far (7). It remains to be shown, wether this motif is directe y 
involved in the toxification reaction or in the translocation process of the L chain into 
the cytosole. Alternatively, this motif could merely serve as an element that stabfe* 
the tertiary structures of the individual L chains (18 ). 

The high level of identity between BoNT/E from C. botulinum and C- 
butyricum suggests a transfer of the BoNT/E gene from C. botulinum to ong.naU 
nontoxigenic C. butyricum strains, and an independent evolution from each other by 
point mutations. A similar unexpected synthesis of BoNT/F in C. baratii has been 
reported previously (11, 20). These findings underscore the general observation h 
classification of C. botulinum strains on the basis, of their neurotoxins bears M 
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1. SUMMARY 

Primers designed to conserved regions of bo- 
tulinum and tetanus clostridial toxins were used 
to amplify DNA fragments from non-proteolytic 
Clostridium botulinum type F (202F) DNA using 
polymerase chain reaction technology. The frag- 
ments were cloned and the complete nucleotide 
sequence of the gene encoding type F toxin de- 
termined. Analysis of the nucleotide sequence 
demonstrated the presence of an open frame 
encoding a protein of 1274 amino acids, similar to 
other botulinum neurotoxins. Upstream of the 
toxin gene is the end of an open reading frame 
which encodes the C-terminus of a protein with 
homology to non-toxic-non-hemagglutinin com- 
ponent of type C progenitor toxin. 



Correspondence to: D.E. Thompson, Department of Health, 
Eileen House, 80-94 Newington Causeway, London, SE1 6EF, 
UK. 



2. INTRODUCTION 

Within the genus Clostridium, organisms pro- 
ducing protein toxins able to cause paralysis 
symptomatic of botulism are deemed to be 
Clostridium botulinum. Outbreaks of botulism in 
humans and animals have resulted in the classifi- 
cation of the neurotoxin into seven serologically 
different forms identified as type A (BoNT/A) to 
type G (BoNT/G). BoNT/A, BoNT/B and 
BoNT/E are the main causative agents of human 
botulism, with rare incidents attributed to 
BoNT/F and BoNT/G [1]. The disease is most 
often the result of ingestion of toxin formed in 
foods that have been inadequately processed 
and/or stored at inappropriate temperatures. 
Another source of the disease is after coloniza- 
tion of the gut by Clostridia in infants of less than 
approx. 6 months, consequently known as infant 
botulism [1,2]. Occasionally botulism is the result 
of infection of a wound. There have been reports 
of BoNT being produced by Clostridia that 
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phcnotypically arc not C botulinum viz. C. bu- 
tyricum and C. barati [1,2], Type F neurotoxin is 
produced by C. botulinum Group I (proteolytic 
strains), Group II (non-proteolytic strains) and by 
C barati [3], 

Comparison of the complete amino acid se- 
quences from BoNT genes (BoNT/A [4,5], 
BoNT/C [6], BoNT/D [7], BoNT/E [8] and 
BoNT/E of C. butyricum [9]), and partial se- 
quence (BoNT/B [10]), with tetanus toxin (TeTx) 
[11] has revealed conserved regions thought to be 
implicated in toxin function [10], Primers based 
on data from these conserved regions were used 
to clone and sequence the gene encoding 
BoNT/F from non-proteolytic C botulinum us- 
ing polymerase chain reaction (PCR). We report 
here the nucleotide sequence of a gene, which 
when translated, gives an amino acid sequence 
similar to those of BoNTs already reported [5-10]. 



3. MATERIALS AND METHODS 

3. 1. Cultures and preparation of DNA 

C. botulinum strain 202F (ATCC 23387) was 
grown anaerobically in TPYCG (trypticase 50 g/1, 
bactopeptone 5 g/1, yeast extract 3 g/1, cysteine 
HC1 0.5 g/1 and glucose 5 g/1) broth and DNA 
prepared as described by Farrow et al. [12]. All 
manipulations of C. botulinum and cloned frag- 
ments were carried out under GMP II conditions. 
E. coli SURE (Stratagene) was used as the recipi- 
ent for transformation. Plasm id DNA was pre- 
pared from E. coli by the alkaline lysis method of 
Bimboim and Doly (see ref. 13) and for sequenc- 
ing, by the method of Treisman as described by 
Sambrook et al. [13], 

3.2. PCR amplification 

PCR was performed with a Biometra thcrmo- 
cycler using the following conditions: 95°C for 5 
min, followed by 25 cycles of: 94°C for 45 s, 
between 37 and 45°C for 45 s and 58°C for 2 min, 
followed by 62°C for 10 min and then 4°C. Primers 
were made on an Applied Biosystems model 391 
DNA synthesiser and used at a final concentra- 
tion of 4 ng/jul. Deoxynucleotides (BCD were 
present at a final concentration of 200 mM and 1 



unit Taq polymerase (Amersham International) 
was used according to the manufacturers instruc- 
tions. Template DNA was at a final concentra- 
tion of 20 ng/jii except for 'inverse PCR T when 
the concentration was 50 ng/jul of restriction 
enzyme-digested and re-ligated DNA. For 'in- 
verse PCR' [14], DNA was prepared using restric- 
tion enzymes (BCD according to the manufac- 
turer's instructions and ligations performed using 
a 'Ligation System' kit (Amersham International). 



3.3. Cloning and transformation 

Products of PCR were cloned into the EcoRV 
site of pBluescript KS + (Stratagene) following 
modification and purification of the fragment by 
Klenow (BCD, 'Geneclean* (Stratatech), kinase 
(Gibco-BRL), phenol extraction and ethanol pre- 
cipitation [13]. An alternative vector system was 
used to construct pCBF2 and pCBF8; DNA frag- 
ments generated by PCR were cloned directly, 
without enzymic modification, into 'T-vector' (D. 
Evans, University of Reading, personal communi- 
cation) digested with Xcm\ (New England Bio- 
labs). This digestion generates a single 3' over- 
hanging T on the vector which base pairs with the 
A, often added to the 3' end of PCR fragments 
during the reaction [15]. Transformation into E. 
coli competent cells, prepared as described by 
Inoue et al. [16], was carried out as described by 
Sambrook et al. [13]. Plasmids containing inserts 
of the expected size were sequenced following 
testing of the recombinant-cell lysates by the 
mouse bioassay [17], to ensure that no toxic 
polypeptides were produced. 



3.4. Sequence determination 

Sequencing of plasmid DNA was carried out 
using dideoxy-nucleotide chain termination. 
Primers complementary to both the plasmid and 
toxin were used to sequence both strands of each 
insert completely. As the inserts were generated 
using PCR, the sequence of two fragments syn- 
thesised in different PCR reactions was deter- 
mined. Where two sequences differed, as they did 
in three positions, a third clone was sequenced 
and in each case confirmed one of the sequences. 
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4. RESULTS AND DISCUSSION 

To ensure that no toxic polypeptides were 
produced only fragments encoding part of 
BoNT/F were cloned in E. coli. Figure 1 shows 
the position of the primers used to amplify re- 
gions of DNA which were subsequently cloned. 
No recombinant plasmid was toxic in the mouse 
bioassay [17]. 

The open reading frame codes for a protein of 
1274 amino acids with a molecular mass of 
146708. This calculated value compares well with 
the measured value of 150 000 for BoNT/F puri- 
fied from proteolytic C botulinum [18]. The pro- 
tein encoded by BoNT/F shows high homology 
with TeTx and other BoNTs. Since these toxins 
are usually cleaved in vivo and as the two chains 
have different functions in toxic activity [10], the 
proteins were divided into light and heavy chains 
for comparative purposes. Comparing BoNT/F 
with light and heavy chains for toxins where the 
sequence is known (data not shown), the light 
chain of BoNT/F is more like that of BoNT/E, 
showing 58.3% and 57.6% identity with the toxins 
of C. botulinum and C. butyriewn respectively, 
than TeTx (45.1% identity) and BoNT/A (35.1% 
identity). The heavy chain also showed greater 
homology with the BoNT/E from C. botulinum 
(64.9% identity) and C. butyricum (64A% iden- 
tity), with BoNT/A and TeTx having identities 
44.6% and 35.0% respectively. The high sequence 
identity of BoNT/F and BoNT/E may explain 



the cross-reactivity of antisera raised against ci- 
ther type toxin with both BoNT/F and BoNT/E. 

Within the toxin sequence there are regions of 
highly conserved amino acids thought to be im- 
portant for structure and function (see ref. 10). In 
the light chain, the region noted by Niemann, 
DPhhnLhHELnHnnHxLYG, where h is a hy- 
drophobic residue, n is uncharged and x is any 
amino acid [10], is also present in BoNT/F (Fig. 
2, residues 220-239). The cysteine residues in- 
volved in disulphide linkage of the heavy and 
light chains are found in all the toxins (Fig. 2, 
residues 429 and 445). The number of residues 
between the cysteines varies between eight 
(BoNT/B) and 27 (TeTx); BoNT/F has 15 amino 
acids. As with BoNT/E, this BoNT/F is encoded 
by a non-proteolytic C. botulinum , it is therefore 
unlikely to undergo proteolytic cleavage by an 
endogenous protease, but a possible cleavage site 
for exogenous proteases is between residues 439 
and 440 (Fig. 2). The C-terminus of the heavy 
chain, a region noted for its lack of homology in 
BoNTs [10], is very well conserved between 
BoNT/F and BoNT/E. It has been suggested 
that this region is involved in the binding of 
BoNT to a gangliosidc receptor [10]. The high 
homology between BoNT/E and BoNT/F may 
indicate that they share a common receptor. 
Analysis of sequences upstream of BoNT/F re- 
vealed a possible Shine-Dalgarno sequence 5'- 
AGGGGG-3' (Fig. 2, nucleotides 132-137), eight 
nucleotides upstream of the proposed start of 
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Fig. 1. Strategy for cloning fragments of the BoNT/F gene. Arrows indicate the position of the primers used to generate the 
fragments. Clones generated by 'inverse PCR' are indicated by *; in the case of pCBFl*/pCBF2* the chromosomal DNA was 
digested with 5aw3Al. for pCBF6* with EcoRI, and pCBF8*/pCBF9* with EcoRV, Only the sites of restriction enzymes 

involved in the construction of clones are shown. 
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translation. Further upstream, nucleotides 33-38 
and 47 (Fig. 2) are identical to those mapped as 
the - 10 and transcriptional start by Binz et al. 
for BoNT/A using primer extension [5]. The 
-35 region suggested by the same authors for 
BoNT/A of 5'-TTAACCC-3' is present virtually 
unchanged (5'-TTAACGC-3') in the same posi- 
tion in BoNT/F (Fig. 2, nucleotides 5-11). How- 
ever, further analysis of this upstream region of 
the BoNT/F gene shows that it contains the 
3 '-end of an open reading frame (Fig. 2, nu- 



cleotides 1-130 and data not shown), translation 
of which into protein sequence gives the C-termi- 
nal 57 amino acids (Fig. 3). The translated se- 
quence shows homology with the non-toxic-non- 
haemagglutinin component of C. botulinum type 
C [19], the gene for which is found in the same 
position, upstream of the toxin gene. Published 
sequences reveal the presence of such open read- 
ing frames in type A [5], type E [8] and C. 
butyricum [9], although not commented upon by 
the authors. As these translated open reading 



ATAAAAACAGQ 
I K T 

SAATCTGAAT* 
E 5 t 1 

ATGAAAAATAl 



AATGAAGCCM 
N E A 



AACrATATTC 

x r : 

GTGAC7ACVrA< 
V T S 1 

ATTCTAATTAl 
! L I 

ATGCGATATGj 
M 3 Y 



TTCTTTAACGCATGCTCATAAr.MTAA^TATATATGTTTA-CrArGAAA^ATGAGAArTA 6C 
TAATTG3ATGATATGTAACAA?GAAAn~AACATAfTT AAAAAGGCA7AT 7TGTGGA7A7T ) ? 0 
GA.AAGAAG7ATAG^GGGATTTTTATGCCAG7TGCAATAAATACTT TTAATTATAATfiAr: 150 



* pnmer 2M> 
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? V N 0 D T I 3. 7 M 0 I r Y E E K S K K 
• » primer 16* 
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S=kv3AI 

CGAACGAATCCTAGrGATTrTGATCCAC-GGCrTCATlAAAGAACoCJAAGCAoTGCTTAT 



T N 



C F D P P A 



(32) 
240 



3F.3 
P2) 



TATGATCCTAA7TAT7TAA0CACTGA7GC7GAAAAAGATAGATATTTAAAAA~AA.CfiATA 4 ? 0 

YDPNYLTTSASKCRYLKTTT 02} 

AAATT AT TT AAGAGAATT A ATAGT A ATC CTGCA GGG AAAGTTTTGT 7 AC A A G AAAT AT - A 480 

K L F K K 1 tJ 5 N ? A G Y. V L !. 0 E I S (H2> 

TATG CT AA ACCATATTTAGGAAATGACCACTACGCC AATTilATGAAT 7C TC7CCAGTTACT 540 

Y A K P Y I G N B H T F I D E F .S ? V T (1321 

AG AACT ACAAGTGT 7 AATAT AAAAT T A7CAACT AA7GT7G AAA.G T 7 C AATCT T AT TG AAT S 00 



F T T S V N 



K L S T N 



CTTCTTGTATTGGGAGCAGGACC7GA7ATA'r?TGAAAGT'rG'iTGTrACCCCG7TAGAAAA 66C 
L L V ;, G A G P D I FES C C Y ? V R K (1721 
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IDPDVVYDPS 



F G S IK 



G7GACATT7TC ACCTGAGTATGAAT AT ACTTTTAATGATA'rr AG7GGA:^.X: ATAA7AGT '/ S 0 
VTFSPEYSYTKN3IS5GH.N5 (217) 
* ' * * * * * primer 3 

agtacagaa7r:atrra7tgcagat:c7gcaatt tcac7agc7catgaa7tgatacatg ca 34 v 
stesf:adpa:slahe:!.:ka (232) 

c 7gc atgg at t a7acggg3c 7 agggg ag 7 7 actt atg aag agac tat agaag taaagc aa sco 

LHGLYGAP. GVTYEET IEVKQ (252) 

GCAC C T C 7T ATG AT AGCCG AAAAAC CC AT A AGGC 7 A^AAGAATTT T 7 A ACC TTTGGAGGT 960 
A ? L M I AEKPIRLEEF.l'tFGO {272! 

CAGG ATTT AAAT ATTA TT ACTAGTGC 7 ATGAAGG AAAAAAT AT AT AACAA7CTT TT AGCT 1 02 C 
ODL NI I 7 3 A K K E K - YNMLtA (292) 

AAC7A7GAAAAAA7AGCTAC7AGACTTAGTGAAG7TAA7AGTGCTCCTCCTGAATA7GAT 1 080 
KY'E.KIATRLSEVNSAPPEYC (312) 

AT7AATGAA7ATAMGATTATTT7CAATGGAJVGTA:-G«iCTAGATAAAAATGCTGATGGA 11 4 C 
I NEYK DY F Q K K YGI DKNADG (132) 

AGTT AT ACTGT AAATGAAA AT AAATTTAATGAAATTT AT AAAAAATT ATA TAG7T TT AC A 1200 
SYTVJiES KFN'KIYKKLYSFT (352) 



GAGAG7GACTT AGCAAATAAAT7T AAAGT AAAATGTAGAAATACT7 ATTTTATTAAAT AT 1260 

E S 0 ". A N K F K V K 3 t) T Y F I K '? (372) 

G.AAT"TTTAAAAGT7CCAAAT7TG7TAGATGATGATATTT AT ACTGT ATCAGAGGGGT7T 1320 
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I 3 S I ? i) K CJ I, V E K I V K F C .K S V (4 32) 

ATTCCTAGAAAAGGTACAAAGGCGCCACCGCGACTATCiC:A"'rA(;AC;7AAATAATA0TGAC 1=00 

: ? r k g 7 k | a ?• « .4 g : p. v r; n s f {<52) 

TTA7TTTr"r(iTAGCTTCAGAAAGTAGCTATAATGAAAATGATATTAATACACTTAAAGAA 1560 



A TTGACGATACAACAAATCTAAATAATAATTATAGAAATAATTTAGATGAA-G TTATT t:a : 62 0 

I D D T T S !, N N' S Y P. N K L D E V I I (492) 

GATTATAATAG-rCAGAT.V.TACCTC-WJVTATCAAATCGAACATTAAATACACTTGTACAA 1SS0 

OYKS0TIPQI5.1P. TLNTLVC (3l2i 

GACAATAGTTATGTG'XAAGATATGATTCTAATGGAATAAGTGAAATAGAGGAAT ATGAT L ^40 

^ N S Y V ? S Y D S M G 7 ' S E ! F £ YD <i32> 

G rTG7TGACTT7AATGTAT7TT7CTAT7TACATGCACAAAAAGTGCCAGAAGGTGAAArr ] 300 

V V D F N V F F Y H A 0 K V ^ E G E 7 (!i52j 

MTATAAGTTTAAGTTGTTCAATrfJATACAGCATTATTAGAAGAATCCAAA.GATATATTT 1 8 SO 

S I S t. T 3 S I37AI, LEESKO I ,F 1^??) 

7777C7TCACAGT7TATCG ATACTATCAATAA.ACCTGT AAATGCAGCACTATTTATAGA7 1020 

F 3 S E F I D T I K K P V t: A A I. F I C (r.32) 

TGGATAA jCAAAG7AATAA , GAGAT77TAGr:ACTGAAGCTACACAAAAAAG7ACTG77GA7 1 9 S C 

WISKVIR3FTTEAT0KS TVS <612i 



AAGATTG7A"ACA7AT-*TTTAATTGTACCCrA7GTAGGTCr7GCTTTGAATATAATTATT 2040 
L I V ? 'f V G I* A I. N I I I (632! 



:< I a d i 



GAGGCAGAAAAAGGAAATTTTGAGGAGGCAT TTGAATTATTAGGAGTGGtf I'ATiTTATTA 2100 
E A E K G N F E E A F V. I, t, G V S I L L (652) 

GAATTTG7GCCAGAACTTACAATTCCTG7AATTTT.AGTGTTTACGA7AAAATCCTATA7A 2 : .60 
EFVPEL7IPVILVFTIKSYI (672) 
prnwt 230 
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DSYENKKF. AIXAINKSLIER (632) 

G AAGCAAAG TGG AAACiAAATAT AT AGTTGG A TACT ATCAAA TTGGCT T ACTAGAA TTAAT ?. ?90 
E A K W K E I Y S W I V 5 N X* L T R I ■ S ( 7121 

ACTCAATT7AA7AAAAGAAAAGAGCAAA7G7ATCAGGC7T7ACAAAATCAAGTAGATGCA 234 Z 
TQFNXFKSQKYOAI. 0 N O V IJ A ;732> 



Fig. 2. Complete nucleotide sequence of the BoNT/F gene. The translated amino acid sequence of BoNT/F is given under the 
second nucleotide of each codon. The position of primers used in PGR to generate the clones is indicated by underlining; * above a 
nucleotide indicates a mismatch between the primer and the DNA sequence. The possible position of cleavage between the light 
and heavy chains is indicated. The cysteine residues are underlined at amino acids 364 and 445. Nucleotides 1 to 130 form part of 
the open reading frame translated to give the sequence shown in Fig. 3. The sequence has been given the Genbank database 

accession number M92906. 
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A™AAAJJVCAGCAATA;;AA7A7AAATATAA7AATTATACrT(JA^A7GAriAAAAATAOAC""T 2403 
I K T A I E Y V Y S K V T 3 0 K K K B - ( '52) 

W.7CTCMTA7AATAa'AATAATATAGAAGMGAA7TyAA?AAAAAAGT7TCTTTAGnA 24 60 
"e'sEYNIMMIEF. F. LXKXVSIA 1772) 

ATGAAAAATATACAAAGATTTATGfiCAGAAAGTTCTATATCTTATTTAATGAAATTAATA 2520 
M K N I K R F T E S « I S Y L M K I. I (752) 

AATf : AAGCC AAAGT 7GG7 AA ATT AAAA A AA7 A 7G ATAACC AT GT 7 A AG ACCOATTT A 7? A 2 iBO 
NEAKVCKLKKYTM-iVKSPLl •' ^ 1 ^ > 

AArTArATTCTCCiACCATAOATCAATCTTAGGAGA^AGACAAATGAAnAAin-^ATT TG 2 6-1 C 
NY* i D K V. S ' L G E C' T N E L S t L i&32) 

GTGACTAGTACTT7GAATftGTAGTATTCCA7T7C:AACri-rf:TTCA7ATACTAA?GATAAA .2700 
V75TLNSSIFrEL£5YTS3K (:>52; 

AT-CTAATTATATA7TT?AATAf;AT7ATATAAAAAAATTAMGA7AGTT^TATT7TAGAT 2 7 &0 
: j, t I Y F N R L Y K :< I K D S 3 ! 0 (872) 

A^GCGATATGAAAATAATAAATTTATAGATATC rCTOGAT A7G(JVi'CAAATA7AA.~£ ATT 232C 
H S Y E H H k r : c : s « if 3 S K I S I (8?2> 

AATG GAAACG T AT ATA7TT A TTC AA C AAAT AGAA/ i7C AAT TTCXiAATATATAATAGTAGG 2853 
NT. HVYlYSTNHNa-0 : ' S S R (41^1 

■" T7 A GTGAAGT? AAT ATAOCTC AAAAT A ATGATA7T AT A T AC AA TAGTAGA T ATC AAAAT 1 ?<S 0 
1 S E V S : A C N S C I I Y H S R Y C N (932) 



rr7AGTA7TAGT7TCTG^TAAGGAlTCC7i\AACACTACAAACCTATGAA7CATAA7fM 
FSISFWVRIPKHYKPMNHNi 



(352) 



GAATACACTA7AATAAATTG7ATriaX«\A7AATAAT7C:X^TG<lAAAATATCAC77AGA 306C 
V. Y T 1 I V C M N N N S G W K I S L R (37;.) 
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T V R D C E I : H T L 3 0 T S G K X E K (392) 

"■"T AATTT 7T AGGTA7G AAG AACTTAAT AGGATAT CTAAT T A 7 ATA.AAT AAATCiGAJTTTT 3 1 8 Z 

"l:?ryeelnrismy:mk«i f (iDi2> 

GTAACTAjTTA^TAAT AAT AGATT AGGC AAT? CT AGAAT7 T AC ATC AATG( JAA ATT7 AA T A 3240 
~V 7 I TNNRLGM2RIYINGJJLI (13*2) 

primct 400 

."TTGAAAAATCAAT wrr.*^TA^rrr.aTAT7CATGT?AyrSATAATATA7TATTTAAA 3300 
VEKS13KLGD:HVSDMILFK CC52) 
« « priwiei 229 

ai^^TTr.-n-rtrmftTr: ftTClAAA CGT ATGTTGGTATAAGA T AT TTT AAAG77TT7AATACG 3160 
"T~v""g~"c DDKTYVGIRYFKVrKT I1C72) 

nAATTafaTAAAACAGA AATTGAGACTTTA TATAG7AAT5AGCCAGA7CCAAGTA7^TTA 342C 
SUDKTEIST1. Y S N E F D F S I L (1092) 

AAAAACTATTGGGGAAATTATT TGCTATATAATAAAAAATA7TAT7TA77 t 7AA7TTACTA 3480 
K H Y W G N Y T" L Y N K K Y Y L F S L (1112) 



EccHI 

AGAAAAGAT AAG7ATATTACTC7GAA7TCAGGCATrT7A AATATTAATCAACAAA.GAGGT 3540 
RKCKYITLSSGriNlMCORG (1132) 

0TTAC7GAAGGCTC7CrrTT7T?7iJMCTA7AAA7TA7ATGAACGAGTAGAAGTCATTATA 3600 
V 7 E G S V F L X Y K L Y R G V E V I I (1152) 

AG AAAAAATGGTC CT A TAG A TATA 7C7AATAC AG AT AATTT TGTT Af JAAAA AACGATC 7 A 3 660 
R Y, -A 0 P I V. I S M T D K F V S K N D L ('172) 

?: A':-ACATTAATGTAG7AGATCGTGG?GTAGAATA7CGG?TATATGCTGA7ACAAAATCA 3723 
AYIS'/vaRGVEYRLYADTKS ill92) 

r^GAAA3AGAAAATAATAAGAACATCTAA7C7 AAACGA7 AGCTTAGSTCAAAT7ATAC77 ? 7 8 3 
£ K £ K I IRTSNLNSSLGQIIV (12:2) 

ATGGATTCAATAGGAAATAATTGCACAATCiAATTT'rCAAAACAATAATGGGAGCAATATA 3840 
» ;) £ I G H N C 7 K N F 0 S N K n 5 N I (1232) 

GG ATT ACT AGG T TTTi '. A'l 7t ' A AAT AATTTGGTT GCT AGT AGTTGGT AT T A7 AAC A AT A7 A 33C0 
C; L - G ? H 5 H M I- V A S S h Y i N N Z (1252) 

CGAAGAAATACTAGCAG7AATGGATGCT7TTGGAG7TCTATT7C7AAAGAGAATGGATGG IMZ 
BRHT3SN5CFWSSISKS NCK C2'2! 

AAAGAATG.ATAGGCCACAGTAAT7AATCTCAAACT;^ATGAG7C7GTCAA.GAATTT77GT 0 D?C 
K £ - • (1274) 

A7AAACATT^ATAAAAAATTTTAAAA7TAArATATTTAAGAATAGCTJv.GATATCAGTAT 4 3 SC 

GCTA7GTTAA7A7C7AXTATT7TAATTTATCCCATA7TATTA7AATAAGAAAAAATACT 4 1 40 

A T7 7 C 7 A7TGTAAGT AC AAG TTT AT7GGT ATC7CA T AAA TGA7 AC AAG ATATC 4193 



Fig. 2. (continued). 
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Fig. 3. Comparison of the amino acid sequence of the C- 
terminus of open reading frames upstream of BoNT. The 
protein encoded by type C has been described as non-toxic- 
non-hacmagglutinin component of the progenitor toxin [19]. 
For type A, type E and type E (but) the amino acid sequence 
was derived from published nucleotide sequences [5,8,9]. Type 
E (but) refers to type E of C butyricum. 



frames are homologous with the non-toxic-non- 
haemagglutinin component of BoNT/C (Fig. 3) it 
is possible that there is a toxin operon encoding 
the components of the progenitor botulinum toxin 
complex in types A and E and C. butyricum, as in 
type C [19]. 
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